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ABSTRACT 

We develop a model for the physical origin and redshift evolution of spheroid scaling relations. We consider 
spheroid sizes, velocity dispersions, dynamical masses, profile shapes (Sersic indices), stellar and supermas- 
sive black hole masses, and their related scalings. Our approach combines advantages of prior observational 
constraints in halo occupation models and libraries of high-resolution hydrodynamic simulations of galaxy 
mergers. This allows us to separate the relative roles of dissipation, dry mergers, formation time, and evolu- 
tion in progenitor properties, and identify their impact on observed scalings at each redshift. We show that, at 
all redshifts, dissipation is the most important factor determining spheroid sizes and fundamental plane scal- 
ings, and can account (at z = 0) for the observed fundamental plane tilt and differences between observed disk 
and spheroid scaling relations. Because disks (spheroid progenitors) at high redshift have characteristically 
larger gas fractions, this predicts more dissipation in mergers, yielding systematically more compact, smaller 
spheroids. In detail, this gives rise to a mass-dependent evolution in the sizes of spheroids of a given mass, 
that agrees well with observations. This relates to a subtle weakening of the tilt of the early-type fundamental 
plane with redshift, important for a number of studies that assume a non-evolving stellar mass fundamental 
plane. This also predicts evolution in the black hole-host mass relations, towards more massive black holes at 
higher redshifts. Dry mergers are also significant, but only for large systems which form early - they originate 
as compact systems, but undergo a number of dry mergers (consistent with observations) such that they have 
sizes at any later observed redshift similar to systems of the same mass formed more recently. Most of the ob- 
served, compact high-redshift ellipticals will become the cores of present BCGs, and we show how their sizes, 
velocity dispersions, and black hole masses evolve to become consistent with observations. We also predict 
what fraction might survive intact from early formation, and identify their characteristic z = properties. We 
make predictions for residual correlations as well: e.g. the correlation of size and fundamental plane residuals 
with formation time of a given elliptical, that can be used as additional tests of these models. 
Subject headings: galaxies: elliptical and lenticular, cD — galaxies: evolution — galaxies: formation — 
galaxies: nuclei — galaxies: structure — cosmology: theory 



1. INTRODUCTION 

Understanding the scaling relations between the photo- 
metric and kinematic properties of galaxy spheroids - their 
masses, sizes, velocity dispersions, and luminosities - is fun- 
damental to explaining their origin. Any model which at- 
tempts to account for these correlations must, of course, also 
determine how they evolve as a function of redshift. In a 
ACDM universe, objects grow hierarchically, implying that 
ellipticals do not comprise a monolithic population formed at 
an early epoch, but have evolved and grown by mergers from 
z > 6 to z = 0. 

Observations are beginning to probe the populations of 
spheroids at these epochs, and reveal that there is indeed evo- 
lution in such s caling relations (see e.g . Truiillo et al. 2006; 
Mcintosh et"aL| 120051: IZirm et all 12007: van Dokkum et al. 



20081) . generally in the sense of massive elUpticals being 



more compact at high redshifts. Some of these systems, in 
fact, appear sufficiently compact (see e.g. IZirm et aP 120071 : 
Ivan Dokkum et al] |2008) that there may be no similar z = 
analogs whatsoever, leading to considerable debate regarding 
the fate of such objects. Ultimately, the causes of such evo- 
lution and its implication for e.g. the fundamental plane of 
ellipticals at those redshifts (and today) are not well under- 
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Stood. 

Moreover, observations now demonstrate that essentially all 
mass ive spheroids host a supermass ive black hole at their cen- 
ters dKormendy & Ric hstone"1 995h . The mass of this black 
hole is tightly c orrela ted with a variety of structural param- 
eters: the m ass (Magorrianet al. 1998), velocity dispersion 
jFerrarese & MeiTitt2000.;Gebhardt et al. 2000.). and concen- 
tratio n/Sersic index (iGrahamet aljr2001l : iGraham & Drived 
l2007h . 

Rec ently, iHopkins etal] (l2007d) and lAUer & Richstond 
(2007) demonstrated that these correlations could be under- 
stood in terms of a "fundamental plane"-like relation, es- 
sentially one where black hole (BH ) mass tracks t h e bind- 
ing energy of the stellar bulge. iHopkins et al.l (l2007bh 
showed that this is the natural expectation of theories where 
some form of feedback from accretion leads to self-regulated 
growth of the BH. Th is has important con sequences for 
e.g. quasar hghtcurve s (IHopkins et al.l l2006ah and Hfetimes 
(IHopkins et al.l2005bh . as well as for the effects of "feedback" 
on the host galaxy, possibly necessary for "quen ching" of star 
formation to create red elliptical galaxies at z = OCroton et alj 
2006; Hopkins et al. 2006b, 2007d, 20083). 

Observations suggest that there may be evolution i n the 
black hole host-galaxy scalings (e.g. iPeng et al.l l2 006i. but 
the details reniain a r nbiguous (see e.g . IWooe t al. 200^ 
Salviander et al. 2006; Lauer et al. '2007a"). As pointed out in 
Hopkins et a l. (2007b), at least some such evolution should 
be a natural consequence of evolution in host spheroid seal- 
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ing relations, provided that there is some fundamental quan- 
tity or property traced by black hole mass. Again, if true, this 
is of basic importance for models of black hole growth, AGN 
and quasar activity, star formation, and galaxy formation and 
evolution. 

A number of important questions therefore arise: If in- 
deed the observations are correct, why are these high-redshift 
spheroids different from their analogs at z = 0? What can 
this tell us about the process of spheroid formation? Can 
we use them to constrain and test models for spheroid for- 
mation, quenching, and black hole growth? Are such objects 
typical, or are they the result of complex selection effects? 
What happens to them by z = 0? Can we find the remnants or 
left-overs of the population today? Are evolution in spheroid 
structural properties related to evolution in their hosted black 
holes? And how is this connected to the cosmological process 
of quasar fueling and bla ck hole growth? 

In a series of papers, iHopkins et al.l (l2008ellallh|lbl) (here- 
after Paper I-Paper IV), we combined Ubraries of hydrody- 
namic simulations of galaxy merger remnants and observa- 
tions of nearby ellipticals spanning the largest available dy- 
namic range in e.g. spatial scale, surface brightness, and 
galaxy properties, and developed a methodology by which we 
could empirically separate spheroids into their two dominant 
physical components. First, a dissipationless component- i.e. 
an "envelope," formed from the violent relaxation/scattering 
of stars that are akeady present in merging stellar disks that 
contribute to the final remnant. Because disks are very ex- 
tended, with low phase-space density (and collisionless pro- 
cesses in a merger cannot raise this phase-space density), 
these stars will necessarily dominate the profile at large radii, 
hence the envelope, with a low central density. 

Second, a dissipational component - i.e. a dense central 
reUc of starbursts from gas that has been brought into the 
remnant, lost its angular momentum, and turned into stars 
in a compact central starburst similar to those observed in 
e.g. local ULIRGs (although these would represent the most 
extreme cases). Because gas can radiate, it can collapse to 
very high densities, and will dominate the profile within radii 
^ 0.5 - 1 kpc, accounting for the high central densities in el- 
lipticals. The gas will probably reflect that initially brought 
in from merging disks, but could in principle also stem from 
new coohng or stellar mass loss in the elliptical (see e.g. 
ICiotti & OstrikerlbOOTh . In subsequent mergers the two com- 
ponents will act dissipationlessly (they are just stars and dark 
matter), but the segregation between the two is sufficient that 
they remain distinct even after multiple major dissipationless 
or "dry" re-mergers: i.e. one can still, in principle, distinguish 
the dense central stellar component that is the remnant of the 
combined dissipational starburst(s) from the less dense outer 
envelope that is the remnant of low-density disk stars. 

In Paper I we developed a methodology to empirically sep- 
arate these components in observed systems with data of suf- 
ficient quality, and tes ted this on observed sampl es of nearby 
merger remnants from lRothberg & JosephI (l2004l) . Using e.g. 
comparison with stellar populations, colors, and other prop- 
erties as well as direct comparison of simulations with ob- 
served surface brightness profiles, galaxy shapes, and kine- 
matics, we confirmed that this could reliabl y extract the dis- 
sipati onal component of the galaxy (see also lKormendv et aLl 
l2008l and references therein). In Paper II we extended this 
to observed ellipticals with central "cusps" and in Paper III to 
those with central "cores" (believed to have undergone subse- 
quent dry mergers, but as we argue, still retaining evidence for 



the i nitial dissipational/diss ipati onless stellar comp onents), 
from lKormendyetal.1 (l2008h and lLauer et aH (l2007bl) . In Pa- 
per IV, we used these results to argue that the relative mass 
fraction in the dissipational component - i.e. the total mass 
fraction of the elliptical built up dissipationally, as opposed 
to brought in as stars in stellar disks, was the most important 
parameter determining the sizes and densities of ellipticals at 
z = 0. We showed that this fraction depends systematically on 
mass: low mass ellipticals experienced more dissipation than 
high-mass ones (presumably reflecting the well-established 
observational fact that low-mass disks are more gas-rich than 
higher-mass disks). We then empirically demonstrated that 
this systematic dependence is both necessary and sufficient to 
explain e.g. the difference between the size-mass relation of 
ellipticals and that of their (ultimate) progenitor disks, and to 
explain the observed "tilt" in the fundamental plane relation 
of spheroids. 

There are, however, limitations in our ability to theoreti- 
cally model these processes using idealized simulations and 
to understand them observing only z = ellipticals. Many 
galaxies are expected to have had complex merger histories 
(perhaps forming early in a rapid series of multiple, highly 
dissipational mergers, rather than a single disk-disk merger 
at late times), which might, in principle, "smear our" or bias 
some of these comparisons or systematically alter some pre- 
dictions (see e.g. Kobayashi 2004; Naab et al. 2007). More- 
over, the gas content in pre-merger disks (and possibly in 
ellipticals them selves) is both expected and observed (e.g. 
lErb et al.]l2006h to evolve strongly as a function of redshift. 
The disks building up ellipticals at z > 2 are much more gas 
rich, so their remnant ellipticals at these times would neces- 
sarily be more dissipational, and as such might have different 
structural properties and obey different scaling relations from 
their z = counterparts. It is therefore of particular interest 
to combine our empirical understanding of the role of dissi- 
pation from local samples, and our inferences from idealized 
simulations, with fully cosmological models that can account 
for the evolution in elliptical progenitors and merger histories 
with redshift. 

Some initial attempts at studying these correlations have 
been made using cosmological simulations JNaab et al. 20071 
lDiMatteoetal.1120081: ISiiacki et al.ll2007t ICroftetal.1 12008). 
It is, unfortunately, prohibitively expensive to simultaneously 
resolve the ^ lOOpc scales necessary to meaningfully pre- 
dict e.g. spheroid scaling lengths, black hole masses, and cen- 
tral velocity dispersions in a fully cosmological simulation 
which would also contain a volume capable of modeling a 
large number of galaxies up to masses ^ IO'^Mq. More- 
over, disk formation remains an unsolved problem in such 
simulations, with the uncertain physics of e.g. star formation 
and feedback probably important to form disks with the ap- 
propriate scale l engths, thickne sses, and bul ge-to-disk ratios 
for their masses j Roberts on et al..2004.;.D'Onghia et al.ll2006l; 
Governatoet"an [2007: Ce verino & Klypinll2007l: IZavala et all 



2007h . If initial disks do not have the appropriate observed 



structural properties, then even a perfect model for spheroid 
formation will have severe systematic inaccuracies. 

In order to get around some of these limitations, alter- 
native attempts have been made to instead predict scaling 



resolutions from semi-analyti c models of g alaxy formation 
(iKhochfar & Silk 2006; Almei da et 31.1120071) . Although they 
could not calibrate the details of their predictions with nu- 



tnei r pr 
merical simulations, Khochfar & SiUd (12006) used this to il- 
lustrate, with very simple assumptions, that increasing dis- 
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sipational fractions in ellipticals at higher redshift leads to 
the expectation that spheroids should be more compact. 
lAJmeida et alj (l2007h explicitly ignored dissipation, and as a 
result found that none of the models they considered could re- 
produce the observed z = fundamental plane correlations of 
spheroids (nor their redshift evolution). 

Such models, however, potentially suffer from the same 
problems regarding the nature of disks (the "initial condi- 
tions"): uncertainties related to modeling spheroid formation 
are difficult to disentangle from prescriptions for e.g. cooling, 
disk growth, star formation, and feedback. Moreover, while 
a few properties can easily be estimated (in a mean sense) 
from a priori analytic arguments, many important properties 
and physical factors affecting spheroid evolution - the role 
of dissipation in changing the shape and velocity structure of 
a spheroid, the detailed density profiles (especially at small 
radii and in extended envelopes) of ellipticals, and the dark 
matter to stellar or baryonic mass ratios within various radii in 
the galaxy - require detailed, high-resolution numerical sim- 
ulations to robustly model. 

One way of circumventing the problems in these cosmolog- 
ical models is to adopt a halo occupation distribution (HOD) 
approach, in which (beginning from a haloH-subhalo distribu- 
tion determined in numerical simulations) halos are populated 
with galaxies according to empirical constraints on their clus- 
tering properties. The success of such models at reproducing 
the observed distributions at a variety of redshifts, as a func- 
tion of v arious galaxy proper ties, has now been demonstrated 
(see e.g. IConroy et alJl2006h . This allows us to begin with 
the empirical knowledge of where galaxies lie, and what their 
gas properties are, without reference to some (potentially in- 
complete) a priori model of e.g. star formation and feedback 
(which we are not, in any case, interested in testing here). Us- 
ing this approach to identify and follow galaxies and their sub- 
sequent mergers, we can then reference each merger to an ide- 
alized hydrodynamic simulation with similar properties (e.g. 
initial morphologies, gas fractions, sizes, masses, etc.), and 
use this to predict what the properties of the remnant should 
be. Constructing a sample of such remnant spheroids at any 
given redshift is straightforward, and allows us to compare 
with observed scaling relations in a direct manner 

In this paper, we adopt this method to combine the advan- 
tages of high-resolution simulations, semi-analytic models, 
and observational constraints on spheroid progenitors in or- 
der to develop robust predictions for the evolution of spheroid 
structure and the correlation between black hole mass and 
host properties. We study how e.g. the evolution in disk gas 
fractions, structural properties, and the interplay between gas- 
rich and gas-poor mergers and multiple mergers in a hierarchi- 
cal cosmology relate to observed correlations from z = 0-4. 

In §|2] we describe our methodolog y, in cluding our cosmo- 
logical models of merger histories (§ I2.2l i. empirical models 
for progenitor disks (§ 12.3b . and library of simulations which 
we use to determine remnant properties (§ \2A\ . In § [3] we 
consider and compare observations to the resulting predicted 
scalings of spheroids at z = 0, and specifically highlight the 
effects of dissipation (§ 13. lb and dry mergers (§ 13.2b . In §|4] 
we make a number of predictions and comparisons to obser- 
vations regarding how these correlations evolve with redshift, 
and again illustrate the effects of dissipation (§ 14.2b and dry 
mergers (§ 14.3b . In § |5] we follow the history of individ- 
ual systems forming at different redshifts to compare them at 
later redshifts, and discuss the fates of early-forming galaxies 
which (initially) reflect these evolved correlations. Finally, in 



§|5] we summarize our results and outline future observational 
tests. 

Throughout, we adopt a WMAP5 (r^M, ^A, /j, erg, n,) = 
(0.274, 0.726, . 705, .812, 0.960) cosmology 

(Komatsu e t al.l l2008l) . and normalize all observations 
and models shown to these parameters. Although the exact 
choice of cosmology may systematically shift the associated 
halo masses (and high-redshift evolution) at a given galaxy 
mass (primarily scaling with a^), our comparisons (in terms 
of stellar mass) are for the most part unchanged (many 
of these differences are implicitly normalized out in the 
halo occupation approach). Repeating our calculations 
for a "concordance" (0.3,0.7,0.7,0.9,1.0) cosmology or 
the WMAPl (0.27,0.73,0.71,0.84,0.96) a nd WMAP 3 
(0.268, 0.73 2, 0.704, 0.776 , 0.947 ) results of 'SpergeletaD 
(2003) and ISpergel et aP (l2007l) has little effect on our 
conclusions. We also adopt a ISalpeteiJ d 19641) stellar initial 
mass function (IMF), and convert all stellar masses and 
mass-to-light ratios accordingly. Again, the choice of the 
IMF systematically shifts the normalization of stellar masses 
herein, but does not substantially change our comparisons. 
All magnitudes are in the Vega system, unless otherwise 
specified. 

2. THE SIMULATIONS AND COSMOLOGICAL MODEL 

2.L Overview 

The model we will adopt consists of the following steps, 
summarized here and described in detail below: 

• We construct the haloH-subhalo mass function, and pop- 
ulate this with galaxies of various masses according to 
a standard halo occupation model approach. We allow 
this population to vary within the range allowed by ob- 
servational constraints, but find this to be a small source 
of uncertainty. 

• We assign un-merged galaxies a scale length and gas 
fraction appropriate for their stellar mass and redshift 
according to observational constraints (with appropri- 
ate scatter). We also systematically vary assignments 
to span the range allowed by observational constraints, 
and find this is the dominant source of uncertainty in 
our predictions. 

• We evolve this model forward in time (following the 
dark matter) and identify mergers. After a merger, the 
properties of the remnant galaxy are calculated as a 
function of the progenitor properties, according to the 
results from hydrodynamic simulations. 

• At some later time, we construct a mock catalogue (uni- 
formly sampling the mock population in stellar mass) 
and compare the predictions with observed galaxy 
properties. 

The exact implementation of the halo occupation model 
and merger rate calculations are outlined in the Appendix, 
for readers interested in reproducing the model calculations. 
Here we outline the relevant physics and model approach, and 
highlight the calibrations from simulations used to predict 
the properties of merger remnants, as well as the important 
sources of uncertainty for those predictions. 

2.2. Cosmological Model 
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We wish to track merger histories in a cosmologically mo- 
tivated manner, in order to follow the growth of spheroids. As 
discussed in §[T] it remains prohibitive to simulate the relevant 
hydrodynamic processes (with the necessary < lOOpc reso- 
lution) in full cosmological simulations (where, in order to 
probe the massive galaxies of interest here, we would require 
~ Gpc^ box sizes). In principle, we could begin with a full 
semi-analytic model: attempt to describe, in an a priori man- 
ner, the entire process of cooling, dis k growth, star formation , 
and feedback. This has been done bv Khochf ar & SilM (120061) . 
for example, as well as Almeida et al. (20071) . and we com- 
pare with their results below. We have also e xperimented with 
this me thodology applied to the approach of lSomerville et all 
(l2008ah . and find qualitatively similar results. 

However, a full semi-analytic model necessitates an attempt 
to predict e.g. the star formation histories, gas fractions, disk 
galaxy populations, and clustering of galaxies in an a priori 
manner Although these are certainly worthy of investigation, 
they are not the quantities we are interested in here - therefore 
attempting to model e.g. disk formation, star formation, and 
stellar winds in this manner will introduce additional uncer- 
tainties and dep endences. 

For example, iKhochfar & Silk! (l2006l) find that their most 
massive galaxies are predicted to evolve very rapidly. How- 
ever, their analysis results in more rapid gas fraction evolution 
than observed, a problem comm on to semi-analytic mo dels 
without strong feedback (see e.g. ISomerville et ani2001h . In 
fact, over the redshift range z ~ 0- 2, progenitor disk galaxy 
properties (at least the broad properties of greatest interest to 
us here, namely their sizes (or the Tully-Fisher relation) and 
gas fractions) are reasonably well constrained empirically, but 
theory has had mixed success at best in reproducing them 
from a fully a priori framework. If current a priori models 
are incorrectly describing the cooling of gas and disk forma- 
tion (in particular the formation of massive galaxies), then the 
predicted properties of ellipticals formed in mergers of those 
progenitors are suspect. 

We therefore adopt a halo occupation approach - essen- 
tially beginning with the observational constraints on e.g. disk 
gas fractions and the Tully-Fisher relation, and predicting the 
properties of spheroids from these empirically constrained 
progenitors. For our purposes, we are not concerned with why 
a given disk population is e.g. gas-rich or gas-poor, or has a 
particular effective radius - we need those numbers to com- 
pute the properties of the remnant of a major merger of such 
disks. This allows us to be as conservative as possible, and to 
identify the robust prediction for spheroid scaling laws with- 
out reference to uncertainties in e.g. disk formation models. 
We can (and do) include the uncertainties in the observational 
constraints in our modeling, but we find that these are unim- 
portant. 

The detail s of our schern e are de scribed in iHopkins et al.l 
(I2008J) and iHopkins et all (l2008d) . but we briefly review 
them here. For a particular cosmology, we identify the halo 
and subhalo populations, and associate each main halo and 
subhalo with a galaxy in a Monte Carlo fashion. Specifi- 
cally: at a given redshift, we calculate the halo mass f unction 
w(Mhai n) for our adopted cosmology following Sheth et alJ 
(1200 lb . For each halo, we calculate the (weakly mass and 
redshift dependent) subhalo mass function (or distribution of 
subhalos, f Wsubhaio iMsubhai o ; Mhai ol) following lZentuer et al] 
dlOOS) and iKravtsov et a l." (2004). Alternative ly, we have 
adopte d it directly from Gao et al. (20041): iNurmietalJ 
(120061) or calculated it foUowing lvan den Bosch et alJ (120051) : 



IVale & Ostrikerl (|2006|) . and obtain similar results: the un- 
certainties at this stage are negligible. Note that the subhalo 
masses are defined as the masses upon accretion by the parent 
halo, which makes them a good proxy for the hosted galaxy 
mass ( Conroy et alJl2006h and removes the uncertainties ow- 
ing to tidal stripping. 

We then populate the central galaxies in each halo and sub- 
halo according to an empirical halo occupation model. These 
empirical models determine both the mean stellar mass and 
dispersion in stellar masses of galaxies hosted by a given 
halo/subhalo mass P(Mgai | Msubhaio), from fitting the observed 
galaxy correlation functions as a function of e.g. scale and 
stellar mass or luminosity at a given redshift. Other proper- 
ties, such as e.g. the sizes and gas fractions of galaxies, are 
determined as a function of their stellar mass (and, according 
to our adopted model, merger history) as described in § 12.31 
l2A2l below. 

Although such models are constrained, by definition, to re- 
produce the mean properties of the halos occupied by galax- 
ies of a given mass/luminosity, there are known degenera- 
cies between parameterizations that give rise to some (usu- 
ally small) differences between models. We therefore re- 
peat all our calculations for our "defaul t" model (for which 
we follo w the empirical const raints in IConrov et al.l l2006l) 
(see also IVale & Ostriker 2006) and an alternate halo oc- 
cupation model (Y ang et al. 2003) (see also Yan et al. 20031; 
IZheng et aP l2005h . which bracke t the range of a number 
of calculations (e.g ICooravll20 05l. 120061; 'Z heng et al.1 120051: 
Ivan den Bosch et all 120071: iBrow n et al. 2008) and direct ob- 
servations of groups ( IWang et al.ll2006l) . 

We have also compared a variety of prescriptions for the 
redshift evolution of various components in the halo occupa- 
tion model: we have adopted that directly fitted by the au- 
thors above at various redshifts, we hav e considered a com- 
plete re-derivation of the HOD models of Conrov e t al.l ( l2006h 
and Vale & Ostriker (2006) at different redshifts fro m eac h 
of the the observed mass functio ns of J Fontana et al. (2006); 
iBund v et all (l2005h: jBorch et alj ( l2006l) : 31anton (.2006.) (see 



Hopki ns et al.ll2008fl) . and have also found similar results as- 
suming no evolution in P(Mgai |Msubhaio) (for star forming 
galaxies). Indeed, a number of recent studies suggest that 
there is very little evolution in halo occupation parameters 
(in terms of mass, or relative to Ly) with redshift (|Yan et alJ 
I2OOI ICooravll200l IConrov et alJl2006l: iBrown et ailJSool y 
or equivalently that the masses of galaxies hosted in a halo 
of a given mass are primarily a function of that halo mass, 
not of redshift (iHeymans et al.1 l2006t IConroy et all [2007). 
This appears to be especially true for star-forming galaxies 
dConrov et alJl2007F) . unsurprising given that quenching is not 
strongly operating in those systems to change their mass-to- 
light ratios, but it also app ears to be true for red galaxies at 
least at moderate redshifts dBrown et alj |2008). 

Consequently, the differences between any of these choices 
are small (at least at z < 3), and negligible compared to the 
uncertainties in our predictions from e.g. the uncertainties 
in disk sizes and gas fractions as a function of their stellar 
mass and redshift. Systems which have not undergone a ma- 
jor merger in the model are assumed to be disk-dominated, 
and we can populate them either according to a halo occupa- 
tion model that treats all galaxies in an identical fashion, or 
as blue/star forming galaxies in a model where the constraints 
are derived for blue and red galaxies separately (we find it 
makes little or no difference). 

Finally, having populated a given halo and its subhalos 
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with galaxies, we follow the evolution of those halos forward 
and identify major mergers^. The details of the treatment of 
subhalo-sub halo and subhalo-halo mergers are described in 
iHopkins et al. (2008f) and Hopkins ^etal] ( l2008d) . but in short 
the halo merger timescale is straightforward and well defined. 
When two subhalos are fully merged, we can either assume 
the galaxies have merged, or allow for some additional merger 
timescale within the merged halo. We have experimented with 
a variety of models for this, including e.g. the dynamical fric- 
tion timescale and alternative timescales calibrated from sim- 
ulations or calculated based on group capture or collisional 
cross section estimates and angular momentum (orbital cross 
section) capture estimates. But because these merger times 
within a halo are always much less than the Hubble time at 
the relevant redshifts, and we are interested only in statistical 
properties across populations evolving over a Hubble time, it 
makes no difference to our calculations what choice we adopt. 
When galaxies merge, we estimate the properties of the 
remnant based on those of the progenitor galaxies, accord- 
ing to our numerical simulations, as described in § 12.41 This 
approach yields reasonable agreement with global quantities 
such as the spheroid mass function, the mass density of el- 
lipticals as a function of mass, and the fraction of ea rly-type 
galaxies as a function of mass (Ho pkins et al.ll2008cl) . Again, 
despite the assumptions involved thus far, we stress that the 
detailed choice of halo occupation model yields negligible 
differences in progenitor galaxy properties and in our pre- 
dictions at all the masses and redshifts of interest, where the 
clustering and abundances of massive galaxies are reasonably 
well-constrained. 

2.3. Progenitors: Laying Galaxies Down 

Given a host halo mass and disk stellar mass, both deter- 
mined from the halo occupation model, the two important 
parameters we must assign to each disk are a size (effective 
radius R^) and gas fraction (/gas = A^gas/(A^*+A/gas))-'' This 
is where the dominant uncertainties in our modeling arise - 
we will show that e.g. the gas fraction in pre-merger disks is 
an important parameter determining the characteristics of the 
remnant, so we can only predict those properties to within the 
uncertainties in disk gas fractions at the appropriate redshifts. 

To minimize the uncertainties in modeling this, we there- 
fore begin with an empirical estimate of the gas fraction dis- 
tribution in spiral galaxies as a function of mass and redshift. 
At z = 0, there is no significant uncertainty; various measure- 
ments (e.g. Kennicutt 1 998; B ell & de Jo ng 2000; Kannappan 
l2004t iMcGaugh et all 120001 iMcGaughl IMBh and indirect 
constrain ts from star forma t ion and the baryonic TuUy-Fisher 
relatio n (iBell et al.l l2003at iNoeske et all 120071; ICalura et al.1 
l2007h give consistent estimates of the median gas fraction in 
spirals as a function of their stellar mass, and the scatter at 

^ In a major merger, tidal forces are sufficiently strong to drive nuclear 
inflows of gas and build realistic spheroids. The precise meaning of ma- 
jor merger in this context is blurred by a degeneracy between the progen- 
itor mass ratio and the orbit (Hernquist 1989; Hernquist & Mihos 1995; 
l^urnaud et al. 200 j), but both numerical I Younger et al. 2008) and obser- 
vational <Dasvra et a l. 2006; Wo ods et alJ2006h studies indicate that massive 
inflows of gas and morphological transformation are typical for mass ratios 
only below ~ 3 ; 1 . This is ultimately related to the non-linear scaling of the 
disk response to a merger of a given mass ratio (Hopkins et al. 2008g). Un- 
less otherwise noted, we generally take the term "mergers" to refer to major 
mergers. 

■* Note that together the disk size, baryonic and halo mass define another 
parameter of interest, the disk circular velocity or maximum velocity; but this 
does not explicitly enter into the way in which we calculate the properties of 
the remnant. 



each mass. We obtain identical results if we use any of these 
constraints (or the data points themselves), but note for con- 
venience that all of these observations can be well-fitted by 
the trend 



(/gas(z=0)) 



1 



1 + (M,/10''15Mq)0-4 



(1) 



with a constant '^ 0.2- 0.25 dex scatter at each mass. 

Similar, albeit less robust, estimates exist at z = 1 and z = 2 
(see e.g. IShaplev et al.l 120051; lErb et al.l 12006 , respectively). 
We therefore adopt a simple functional form that interpolates 
between these measurements. Motivated by the power-law 
form of the observed Kennicutt-Schmidt star formation rela- 
tion (M, ex Sga^), we adopt the following interpolation for- 
mula: 



(/gas(z)> = /gas(z = 0) 1 + Tlb(z) d "/gasfe = 0)'^) 



-2/3 



(2) 



where (3 sa 3/2 and tlb(z) is the fractional lookback time to 
redshift z- This can be derived, for example, by assuming a 
simple model where the star formation rate scales according 
to the observed relation, the disk scale length varies as a power 
of the baryonic mass, and the net mass accretion rate (inflows 
minus outflows) is also a power-law function of the mass of 
the system; given the requirement that the system begin at an 
initial time f = with /gas = 1 and have the observed /gas at 
z = 0. 

In any case, the v alues above provide a goo d fit to the obser- 
vations at z = - 2 (iBell & de J ong 2000; Sha plev et al.]|2005l; 
lErb et al.) 12006) and a reasonable approximation to results 
of cq smological simulations ( Keres et al. 2005; Keres et al] 
l2007h and semi-analytic models JSomerville et al.l l2008al) . 
and interpolate smoothly between z = and high redshift. 
Lowering /? systematically weakens the implied redshift evo- 
lution in /gas - in order to represent the uncertainty in the 
observations, we consider a range (3 ^ 0.5-2.0 (bracketing 
the observational errors). Similar prescriptions for /gas evo- 
lution can be obtained from various toy model star forma- 
tion histories includi ng e.g. commonly adopted exponentia l 
T-models (see e.g. B ell & de Jongll2000l: iNoeske et al1l2007h 
and integration of specific tau models enforcing proportion- 
ality between inflow, star formation, and out flow rates (|Erb| 
2008); these are all discussed in more detail in lHopkins et al.l 
(2007b, 2008f ), but they generally yield similar results and lie 
within the uncertainties we consider (differences being much 
less than the typical observed scatter in /gas at each mass and 
redshift). 

The other important property we must estimate for pro- 
genitor disks is their effective radius. At z = 0, this is 
well-measure d for all di sk masses of interest. We employ 
the fits from IShen et all |2003) to late-type galaxy sizes as 
a function of steUar mass (from < IO'^Mq to > IO'^Mq), 
and the scatter in those sizes. Adopting different local es- 
timates or even completely ignoring this scatter makes lit- 
tle difference. Observations are somewhat more ambigu- 
ous regarding the redshift evolution of sizes, but fortu- 
nately the o bservatio nal constraints (e. g. Truiillo et al. 2006; 
Barden et al. 2005; Ravindranath et al.1^004: .Rix et al...2004: 



Ferguson et al. 2004; lAkivama et al. 20()a) and state-of-the- 
art models (Somerville et al.ll2008b) suggest that any evolu- 
tion is relatively weak. This is also reflected in e.g. the bary- 
onic Tully-Fisher r elation, which appear s to evolve n egligi- 
bly from z = - 2 (IConselice et al.ll200l iFlores et all 120061; 
iKassin et al.|[2007l;lvan Dokkum et al.ll2004" 
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We therefore consider two extremes. In the first case, disk 
sizes and the baryonic Tully-Fisher relation are completely 
fixed with redshift, according to their z = values. In the sec- 
ond, disk sizes evolve according to observational estimates 
from the various measurements above. A simple power law 

ReM^M, |Z] = (1 +z)-'^'ReMA[M. \z = 0] (3) 

where (3d k, 0.4 provides a good fit to the observations and is 
consistent with all of the measurements, spanning the redshift 
range z = 0-3. We adopt this estimate, but obtain a simi- 
lar result if we use the more detailed cosmologically moti- 
vated model in ISomerville et al . (2008b), which allows disks 
to fo rm conser ving specific angular momentum from halo gas 
(see iMo et al.l 11998), yielding (approximately) Rj oc /^vii/c, 
where c ex 1/(1 +z) is the halo concentration - although ^vh 
is smaller for halos of a given mass at high redshift, they 
are less concentrate d, yielding a weak size evolution simi- 
lar to that observ e d ^B ullock et al.l200l UWechsler e t al.l2002t 
Neto et all 120071: [Comerford & NataraianI 120071: iBuote et al. 



2007ir 



Together this gives us an estimate of our input disk pa- 
rameters, and we allow for the described range in both the 
estimated gas fraction and disk size evolution. At low red- 
shift, this introduces little uncertainty (most spheroids at z = 
have assembled relatively recently, so their structural proper- 
ties reflect those of low-redshift disk progenitors, where the 
uncertainties are minimal and the relevant sizes, etc. can be 
directly measured from observations). However, these un- 
certainties begin to grow rapidly at z > 2 - 3, where there 
are no more direct observational constraints. Moreover, at 
these redshifts, it is not clear that "un-merged" galaxies will 
necessarily be disks analogous to those well-understood at 
z < 2 (see e.g. the cl umpy morphologie s and increased dis- 
persions observed in iReddy et aP l2006t iFlores et aH 120061: 
iBournaud et al.l l2008 l). although higher-resolution observa- 
tions suggest that at least a significant fraction of this popula- 
tion does exhibit regular rotation and smoother morphologies 
(albeit most Hkely still puffier and more dispersio n-supported 
than low-redshift disks) in the rest-frame optical (iPuech et aL 



20071; lAkivama et al.ll2008l; iGenzel et al]|2008t IShapiro et al 



20081). 

In either case, the uncertainties as this limit is approached 
can be considered part of the uncertainty in e.g. progenitor 
sizes and structural properties, as observational estimates of 
this quantity often do not make strict morphological cuts at 
these redshifts but rather assign gas-rich star-forming systems 
to a single category (and this is how such uncertainties would 
enter into our model). In this aspect, the uncertainties from as- 
signing them implicit "disk" properties may actually be less 
than one might expect, because at the point where a galaxy is 
very gas-rich (increasingly common at these high redshifts), 
it does not matter what initial configuration this gas is in 
(whether e.g. clumpy, filamentary, or infalling in direct col- 
lapse), as it is dissipational and in any case will (in mergers) 
lose angular momentum and form a dense central starburst. 
In any case, these uncertainties lead us to limit our predic- 
tions to z < 4. Fortunately, because at any lower redshift most 
spheroids have assembled relatively recently, with decreasing 
mass fractions contributed by galaxies assembled at very early 
times, the uncertainties associated with the lack of constraints 
at these high redshifts rapidly become less important to our 
predictions. 

2.4. The Simulations 



In order to determine the effects that a given merger will 
have on galaxy properties, we require some estimates of how 
e.g. the gas content of initial galaxies (and other properties) 
translates into properties of the remnant. We derive these 
prescriptions from a large library of hydrodynamic simula- 
tions o f galaxy encounter s and mergers, des c ribed in detail 
in Ro bertson et al.1 (l2006l) : ICox et all (l2006h : lYounger et all 
(2008) and Paper I. These amount to several hundred unique 
simulations, spanning a wide range in progenitor galaxy 
masses, gas fractions, orbital parameters, progenitor struc- 
tural properties (sizes, concentrations, bulge-to-disk ratios), 
and redshift. 

Most of the simulations are major (mass ratios ~ 1 : 3- 
1:1), binary encounters between stellar disks (which pro- 
vide an idealized scenario useful for calibrating how de- 
tails of the remnant depend on specific progenitor prop- 
erties), but they include a series of minor mergers (from 
mass ratios ~ 1 : 20 to ~ 1 : 3), as well as spheroid- 
spheroid "re-mergers" or "dry mergers" (i.e. mergers of the 
elliptical remnants of previous merger simulatio ns), mixed- 
morp h ology (spiral-elliptical) mergers (see also B urkert et al.l 
120071; [Johansson et al.l l2008l for a detailed study of these 
mergers), multiple mergers, and rapid series of hierarchi- 
cal mergers. Our adopted prescriptions are robust to these 
choices. The simulations usually include accretion and feed- 
back from supermassive black holes, as well as feedback from 
supernovae and stellar winds. However, we have performed 
parameter studies in these feedback prescriptions, and find 
that the structural propertie s of interest here are relatively 
insensitive to these effects (ICox et al.l 120081; iHopkins et al.l 
2007b, 2008e). 

The predicted simulation scalings are discussed in de- 
tail in Paper I-Paper III, where we take advantage of 
high-resolution observatio ns of local merger remnants (from 
iRothberg & Joseph! |2004|) as well a s both nuclear cus p and 
core ellipticals (from iLauer et all 2007b; Kormend y et al.l 
l2008l) to test them observationally. We find good agreement 
between the predicted and observed scalings with e.g. stellar 
mass and gas content (of the progenitor galaxies) at the time 
of the merger If, as is commonly believed, core ellipticals are 
the product of spheroid-spheroid re-mergers, then our com- 
parisons in Paper III also confirm the scalings derived from 
simulated re-mergers. 

2.4.1. Mergers without Gas 

In mergers, the stars and dark matter form a coUisionless 
system, and hence gas free (dissipationless or dry) mergers 
are particularly easy to handle. Furthermore, direct compar- 
ison with our simulations suggests that, to the desired accu- 
racy, it is a good approximation to treat the dissipational and 
dissipationless components of a merger separately, allowing 
us to apply simple rules to the stellar remnant in mergers of 
both gas-rich disks and gas-poor spheroids. 

It is well established in numerical simulations that, to 
lowest order, the effect of any merger of dissipationless 
components is to "puff them up" by a uniform factor, 
while roughly conserving profile shape (at least for cer- 
tain "e q uilibrium" profile shap es s uch as the iNavar ro et al.l 
(119961) . iHernauisJ (|1990[) or IdTV aucouleurs (1948 ) pro- 
files ; see e. g. 'Barnes'' 1991 iBovlan-Kolchin et al.ii2005i. 120061: 
Hopkins et al. 2008h). The conservation of both energy and 
phase space density means that, modulo a small normalization 
offset owing to possibly different profile shapes the remnant 
of e.g. two initial stellar disk or spheroid mergers wiU have a 
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similar final effective radius. 

In detail, if we temporarily ignore the halos of the galax- 
ies (a good approximation in most simulations, since the spe- 
cific binding energy of the galaxy baryonic mass is much 
larger than that of the halo) and consider isotropic systems, 
we obtain for the parabolic merger of systems of mass Mi and 
M2 = f Ml (f < 1) the energy conservation equation 



Ef = kf (Ml + M2) CT/ = £■; = /ti Ml CT^ + k2 M2 aj 



(4) 



where A; is a constant that depends wea kly on profile shape 
(for greater detail, see ICiotti et al.ll2007l) . If profile shape is 
roughly preserved and R oc M/a^, then we expect 



Rf = R\ 



(l+ff 



(i+Z^i;) 



(5) 



A merger of two perfectly identical spheroids will double R^ 
and conserve a. It is straightforward to solve the appropriate 
energy conservation equation numerically, allowing for arbi- 
trary profile shapes and for halo components (assuming the 
halo profile follows Navarro et al. (1996) or Hernquist ( 199C3) 
profiles), but in practice we find that these subtleties make no 
almost no difference compared to using Equation (|5]). 

In a merger of two stellar disks, violent relaxation will 
transform the stellar distribution from initially exponential 
disks into de Vaucouleurs (1948)-like profiles, more gener- 
aUy lSersid (Il968h profiles (of the form /, ex exp[-(r/ro)'/"'] 
with ris ~ 2.5 - 3.5 after a single maj or disk-disk merger 
(see Paper I-Paper II and iNaab & Truii llo 2006), while the 
halo profile shape is approximately preserved (Paper IV, 
iBoylan-Kolchin et al. 2005). These trends in simulations can 
both be roughly explained by allowing the merger to scatter 
stars and dark matter (in addition to uniformly puffing up the 
profile) in their final three-dimensional radii by some lognor- 
mal broadening factor ar ~ 0.3-0.4dex (in a 1 : 1 merger). 
In other words, while the median final (post-merger) effective 
radius of stars at some initial radius r, is given by the uni- 
form puffing up or stretching of the profile in Equation (|5]l, 
there is a lognormal scatter with dispersion <Tr in the final 
radii of stars from that initial radius. Over the relevant dy- 
namic range for most practical observational purposes (from 
^ O.QIR^ to > 10 Re, or as much as 15 magnitudes in surface 
brightness), this will effectively transform an initial exponen- 
tial profile into an n^ « 3 (ct^ = 0.3) or n., = 4 {ar = 0.4) profile. 

Subsequent spheroid-spheroid or spheroid-disk mergers 
continue to scatter stars out to larger radii, building an ex- 
tended envelope of material, and raising the best-fit Sersic 
index of the dissipationless component of the remnant. We 
study this in Paper III, and find to a good approximation that 
a re-merger of mass ratio / < 1 will raise the «j of the dissi- 
pationless component by An, ~ /, or equivalently "broaden" 
the three-dimensional stellar distribution by a lognormal fac- 
tor ^ 0.3/ (the prescription we adopt). At z = we can then 
determine an "observed" Sersic index by projecting this mock 
profile (with that of the dissipational component) and fitting 
to the system. If we consider the Sersic indices directly fitted 
to our simulations - i.e. draw a Sersic index for a disk-disk 
merger remnant randomly from that fitted to our simulated 
mergers of the same mass ratio, and so on - we obtain similar 
predictions. 

Finally, having the profile of the dissipationless stellar and 
halo components, we can calculate the velocity dispersion a 
of stars within some radius (here we take the mass-weighted 
velocity dispersion of stars within Re, and assume isotropic or- 



bits, although this is not a dominant uncertainty in our predic- 
tions). We can instead (without any reference to the isotropy 
assumption) assume that the dynamical mass estimator, a^Re, 
is a good tracer of the true enclosed total mass within the stel- 
lar effective radius Re (modulo a normalization constant that 
is the same for most ellipticals) - a fact seen in our simu- 
lations (Paper IV) and o bservations (ICappellari et al.l 120061: 
iBolton et all l2007l l2008h - and use the known total mass 
(M(< Re)) to predict a. The results are similar. 

We note that halos are tracked with stars in these processes, 
but between mergers, the halos still grow. We ass ume at all 
times that the dark matter halo follows a iHernqu ist (199Ci) 
profile (essentially identical to an'Navar ro et al.l ([1996) profile 
at the radii that matter for the baryonic galaxy, but with finite 
mass and analytically tractable properties, a more useful ap- 
proximation since we are considering halos as if in isolation), 
with a concentration from the mean concentration-halo mass- 
red shift relation in simu lations and observations (see § 12.31 
and lSpringel et al.l (l2005h ). We then calculate the halo contri- 
bution to quantities such as e.g. the dynamical mass within Re 
and (J based on this profile. However, we could also assume 
the dark matter within the stellar effective radius "freezes out" 
with each major merger (new dark matter within Re only be- 
ing what is brought in by subsequent mergers, within the Re of 
those galaxies at the time of their first merger) - it makes rel- 
atively little difference, since the mass densiti es in the cores 
of ha los do not strongly evolve with redshift (iBuUock et al.l 
l200l . 

2.4.2. Mergers with Gas 

In a gas-rich merger, the gas loses angular momentum 
and, being dissipative, radiates its energy and collapses to 
the center of the galaxy. The collapse will, in general, con- 
tinue until the gas becomes self-gravitating, at which point 
it rapidly turns into stars as it further contracts (for details, 
see Hopkins et al. 2008d). The remnant will exhibit a "dissi- 
pational component" - the dense central stellar concentration 
which is the relic of these dissipational merger-induced star- 
b ursts. 

ICovington et al.l (l2008l) consider the dissipation of energy 
in mergers and use it to estimate the effe ct this will have o n 
the size of the merger remnant (see also ICiotti et al.ll2007h . 
which we test in Paper II and Paper III and show is a good 
approximation to our simulations and observed systems, and 
can be reduced to the approximation 



Re 



/? (.(dissipationless) 

l+(/gas//0) ' 



(6) 



where /o ~ 0.25-0.30, /^(.(dissipationless) is the effective 
stellar radius of the final merger remnant if it were entirely 
dissipationless, and /dissipational is the baryonic mass frac- 
tion from the starburst. For most major mergers, gas in 
the disks at the time of the merger will rapidly lose energy, 
and star formation is very efficient, so /dissipational reflects the 
gas fractions of the progenitor disks immediately before the 
merger /dissipationai(merger) = /gas = Mg^s (disks) /M, (remnant) 
(gas blown out by feedback from black holes and stellar 
winds, even with extreme feedback prescriptions, makes lit- 
tle difference here: it predominantly effects /gas of the pre- 
merger disks over much longer timescales, rather than the 
consumption in mergers). We show in Hopkins et al. (2008(|) 
that this is not strictly true in extremely gas-rich mergers 
(/gas > 0.5), where torques are inefficient at driving dissipa- 
tion, and ellipticals may not be formed from even major merg- 
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ers: these extreme cases however are seen only at the lowest 
observed masses, and do not affect most of our predictions 
(though we note where this may be important). 

Given the typical exponential-like (hs ^ I- 2) character of 
these dissipational components when they first form (owing 
to their formation from infalling gas), we can invert this to 
infer what the effective radius of the dissipational component 
specifically must be (given that effective radius and profile, 
we can then construct a mock profile of the entire galaxy). 
Based on our study of re-merger simulations in Paper III, we 
find that the two components (those which were originally 
dissipational at their formation, and those which were not - 
i.e. which originally came from stellar disks) tend to be sepa- 
rately conserved even after several dry or gas-poor spheroid- 
spheroid re-mergers. In a series of mergers, then, it is a 
good approximation to treat the two separately: dissipation- 
less components grow and add as described above. Dissipa- 
tional components form with a mass given by the gas content 
of merging disks with sizes appropriate for the above rela- 
tion, and then evolve either by gas poor (spheroid-spheroid) 
merging (where they will add with other stellar dissipational 
components formed in earlier mergers, following the rules for 
dissipationless merging above since both components are stel- 
lar), or by more gas-rich (spheroid-disk) merging (in which 
case new dissipational mass is added given the prescriptions 
above). 

Dissipation will also fuel growth of the central supermas- 
sive black hole. Based on our theoretical modeling of self- 
regul at ed black hole g rowth i n simulations (Di Matte oet alJ 
2005'; 'HoDkins et alJ l2006aL iHopkins & H ernauist 2006t 
Hopkins et al. 20071^) and that of others (e.g. Silk & Reel 



19981: iMurrav et al.ll2005HCiotti & Ostrikeil2007l) . essentially 



any model where the black hole regulates its growth by halting 
accretion (allowing some fraction of the energy or momen- 
tum of accretion to couple to the surrounding media) will give 
rise to a fundamental corre lation between black hole mass and 
spheroid binding energy ( Hopkins et alJl2()()7bh . more specif- 
ically with the binding energy of the gas in the central re- 
gions whose inflow must be suppressed to regulate black hole 
growth. 

T here is direct ev i dence f or this in the observations as well, 
with iHopkins et al.l (l2007cl) finding a ^ 3 cr preference in the 
data for a correlation of the formMsH ^ M"-^"*^-^ ct'^"^ ", sim- 
ilar to and consistent with the "fundamental" correlation be- 
ing one with bulge binding energy, which is roughly traced 
by Ep ~ M*(T^ (Mbh oc fi"'^', see also AUer & Richstone 
l2007h . Indirectly, this can then explain e.g. the obs erved 
Mbh-ct (iFerrarese & Merritlll2O()0l:lGebhardt e"tani200Q) and 
Mbh - M^ ( Magorria n et al.lll998F correlations. 

Motivated by our simulations and these observational re- 
sults, we model black hole growth as follows: in each merger, 
we allow the black hole to grow by an amount proportional 
to the binding energy of the new dissipational components of 
the merger (i.e. the starburst gas), in addition to summing the 
masses of the two progenitor black holes (we do not model 
any gravitational kicks that might expel black holes, but theo- 
retical estimates suggest that by z = in the massive galaxies 
we are intere sted in here, such effects are small if they are 
present at all; I Volonterietal] 120061; IVolonterill2007l) . We in- 
clude an intrinsic scatter ~ 0.2 dex, again motivated by sim- 
ulations (although there will be some scatter in the result- 
ing correlations regardless, reflecting different gas content 
and potential depth at the time of merger, with subsequent 
evolution to z = 0). The exact proportionality constant is a 
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Fig. 1 . — Predicted spheroid scaling laws at z = 0. Top: Effective radius 
as a function of stellar mass. Left: Our simulated systems are shown (equal 
numbers of systems per logarithmic interval in mass are simulated and plot- 
ted, to represent the full dynamic range), with color encoding the redshift of 
their last gas-rich merger (from black for z = to red for z > 3); solid lines 
show the median trends for systems with this redshift: z = 0— 1 (black), 1—2 
(blue), 2—3 (orange), and > 3 (red). Right: Solid (dashed) line shows the me- 
dian(±l(T) trend from the simulations. Observed systems from the samples 
of'Kormendy et alj ^2008), red stars) and iLauer et alj t2007b, purple circles) 
are shown for comparison (right). This notation is used throughout. Bottom: 
Projected central velocity dispersion as a function of stellar mass. 
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Fig. 2. — The fundamentalplane. Top: Dynamical mass as a function of 
stellar mass (style as in Fig. [T). Bottom: Effective radius as a function of a 
and p (here converted to stellar surface density S). Dotted red line shows the 
virial relation (no tilt in the fundamental plane). 

reflection of the physics of feedback efficiencies and cou- 
pling, but we adopt a constant calibrated in simulations to 
match the normalization of the observed z = Mbh - c re- 
lation (DiMatteoetal. 2005; Hopkins et al. 2005a, 2007b). 
Any other results (e.g. the slope of these correlations, residual 
correlations or correlations with other structural parameters, 
and/or redshift evolution in the correlations) are genuine pre- 
dictions of the model, not this calibration. 

3. PREDICTED SCALING LAWS AT Z = 

Figures [T] & |2] plot the fundamental plane correlations of 
ellipticals at z = 0, produced by this simple model. Here (and 
in subsequent figures) we draw systems randomly from the 
halo occupation distribution with the restriction that we draw 
equal numbers of simulated systems per logarithmic interval 
in halo mass (in order to represent the full dynamic range of 
interest in the plots). This differs slightly from selecting e.g. 
a volume-limited sample, which would be dominated by low- 
mass systems (less useful for studying the distribution in a 
parameter at a given mass, which is our intention here). 

We compare with a large sample of observed ellipticals : 
specifically the combination of the iKormendv et al.l (l2008h 
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comp ilation of V irgo ellipticals and the bright elliptical sam- 
ple of lLauer et al . (2007b) (which extends this dynamic range 
at the cost of slightly higher uncertainties in the data), with 
nuclear HST observations and ground-based data at large 
radii (allowing accurate surface brightness profile measure- 
ments from ^ 50 pc to ~ 50kpc)^. The correlations traced 
by these samples throughout this paper are statistically indis- 
tinguishable from the best-fit correlations (over the range ob- 
served) determined from volume-limited samples of (primar- 
ily fiel d ) early-type ga l axies in the SPSS, in e.g. Shen et dj 



(I2003h: iBernardi et al. 



Ivon der Linden et al] 



(I2OOI I2OO7I) : iGallazzi et al. (200j 
(l2007h . There may be a shght (not 
highly significant) offset between the data-sets, in the sense 
that the objects in the Vi rgo sample may hav e slightly larger 
Re and smaller a than thel Lauer et al.l (l2007b) sample at fixed 
mass; if real, this may relate to the former being a cluster sam- 
ple, or to the likely inclusion of a non-negligible SO popula- 
tion in the latter But in any case, the possible offset between 
the two is smaller than the uncertainties in our modeling (in 
terms of normalizing e.g. absolute sizes of systems, requiring 
accurate priors on their progenitor disk and halo sizes), and 
within the predicted and observed scatter, and so reasonably 
brackets the range of different observations. (For more details 
of the observational samples and their analysis, we refer to 
those papers and Paper 11.) 

The correlations are accurately reproduced at z = 0. Specif- 
ically, the size-mass relation has a steep logarithmic slope 
Re ~ M^l^, compared to Re ^ M'^-^ for the progenitor disks 
- the sense of this is specifically that low-mass ellipticals are 
more compact, relative to their progenitor disks. The velocity 
dispersion-mass (Faber-Jackson) relation is also reproduced, 
although this correlation is less dramatically different from 
that obeyed by disks (the baryonic Tully-Fisher relation). The 
resulting fundamental plane, expressed as either Mdyn(M,) or 
Re(cr, M,), is also reproduced; i.e. the predicted ratios of en- 
closed dark matter to stellar mass within the stellar Re agree 
with observations as a function of mass (note that with the 
adopted definition of Mdyn oc a^Re, Mdyn can be less than 
M,). Specifically, the tilt of the fundamental plane is repro- 
duced - rather than Mdyn oc M,, the expectation if systems 
were perfectly homologous, we predict Mdyn « M^"*"" (where 
a « 0.2 is the FP tilt), namely that lower-mass systems (as a 
consequence of their being more compact) are more baryon- 
dominated within their stellar effective radii. (The nature of 
the FP scalings are discussed in much greater detail in Paper 
IV; however, we outline the dominant physical effects in § 13.11 
&|12]below.) 

We plot the simulated systems, with colors denoting the 
redshift of their last gas-rich merger - i.e. approximately the 
time at which they should have stopped forming stars. It 
is clear that, at fixed stellar mass (or velocity dispersion, 
given that it more tightly correlates with stellar mass than 
e.g. effective radius), systems with larger radii (or larger 
Mdyn/M*) tend to be older - they are often systems that 
merged at early times and have grown dissipationlessly since 

' The profiles cover sufficient dynamic range that elifective radii can be 
determined directly from the integrated surface brightness profiles; adopting 
those from fitted profiles changes the results by a negligible amount. Velocity 
dispersions are compiled by the authors. Sersic indices are fitted in Paper 
II, and stellar masses are determined from the integrated photometry using 
the color-dependent mass-to-light ratios from Bell et al. (2003b). We have 
compared the results given different observed profiles for the same objects 
and different stellar mass estimators, and find that (in a statistical sense) they 
are unchanged. 
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Fig. 3. — Correlation between stellar mass and estimated best-fit galaxy 
Sersic index «, (style as in Figure[T). Top: Sersic indices obtained fitting the 
entire galaxy profile to a single Sersic law (over typical observed dynamic 
range). Bottom: Sersic indices of just the dissipationless (outer, violently 
relaxed) component (not including the dissipational central starburst). Ob- 
servational estimates are from the two-component modeling in Paper II and 
Paper III. Owing to the cosmological dependence of merger history on mass, 
there is an effective correlation between stellar mass and n,. This is most 
directly reflected in the outer, dissipationless component. Fitting the systems 
to a single Sersic index represents a more complex combination of merger 
history and the amount of dissipation (both effects can mimic one another). 

then. This appears to be borne out in recent observational es- 
timate s based on stellar population constraints in local ellip- 
ticals (IGallazzi et al.ll2006l; iGraves et alj|2008h . (We discuss 
this further below.) 

Figure [3] plots the Sersic index of the galaxy light profiles 
predicted at z = 0. We stress that these are rough estimates 
at best - individual galaxy profiles show a degree of diversity 
representative of many details in their formation and merger 
history that only high-resolution numerical simulations can 
capture. We therefore refer to Paper 11 and Paper 111 for a 
more detailed study and comparison of Sersic indices in sim- 
ulated and observed systems; however, comparison of our es- 
timates with numerical simulations suggests that the adopted 
prescriptions capture at least the median behavior in simula- 
tions. 

First, we consider the Sersic index fitted to the entire galaxy 
profile - this is of course a useful observational parameter, 
but it reflects a mix of different physical components of the 
galaxy. In order to transform a galaxy from an exponential 
disk profile (a Sersic index «, = 1) to a typical massive ellipti- 
cal with a high Sersic index > 4, both dissipation and violent 
relaxation are necessary. Violent relaxation will scatter stars 
at large radii, building up an extended envelope (characteristic 
of high-Hj systems), but high-n, profiles rise steeply at small 
radii, so dissipation is needed to build up a dense central stel- 
lar concentration. The effects of repeated mergers on the pro- 
file at large radii are relatively straightforward, and (more or 
less) monotonically increase n^, but the consequences of dis- 
sipation are more complex (related not just to the dissipational 
mass fraction but to the relative concentration of the dissipa- 
tive component and its profile shape) and non-monotonic. 

We construct the best-fit Sersic index fitting a dynamic 
range comparable to the observations plotted (representative 
of local galaxies for which the highest-quality data is avail- 
able), from ^ 50pc to ^ 30kpc. Because the galaxy profiles 
are multi-component, the resulting Wj reflects a complex com- 
bination of the observed dynamic range, the relative mass, 
size, and slope of the dissipational component, and the merger 
history. Together, these effects yield an apparent steep depen- 
dence of Sersic index on mass, in agreement with the obser- 
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Fig. 4. — Correlation between black hole mass and host spheroid prop- 
erties (effective radius, velocity dispersion, stellar mass, and the observable 
proxy for bulge binding energy, M, cr^, effectively the same as the "black 
hole fundamental plane" observed). Style as in Figure[T] Magenta stars show 
observed systems with direct black hole mass measurements, compiled in 
iHoDkins et al. l2007c) (see Magorrian et al. 1998; Menitt & Ferrarese 200]]; 
riVemaine et al.. l2002l : iMarconi & HunHl200aiHaring & Rixl l2004). A sim- 
ple prescription for black how growth based on dissipational binding energy 
reproduces the z = correlations. 

vations, but with large scatter (whereas if the profiles were 
entirely dissipationless, the lack of dense stellar concentra- 
tions at small radii would restrict them to smaller Sersic in- 
dices and lead to a weaker dependence of n, on merger his- 
tory and mass). Most of the strength of the dependence here is 
ultimately a coincidence reflecting particular combinations of 
parameters moving into or out of the dynamic range for which 
the fits are sensitive. 

If we consider the Sersic indices of just the dissipationless 
component (i.e. the sum of the stellar components of pre- 
merger disks that were violently relaxed in the initial gas- 
rich mergers that formed each spheroid progenitor), the de- 
pendence on mass is weaker, but it more directly reflects the 
merger history of the systems (and depends much less on e.g. 
the dynamic range of the fit). We compare this with multi- 
component decompositions of the same observed systems, 
based on a more detailed, physically motivated approach to 
studying galaxy profiles which we discuss in detail in Paper 
1-Paper 111, where we demonstrate that the "dissipationless" 
component of observed ellipticals can, in fact, be recovered. 
By definition in our simple model, the dependence we predict 
now is entirely driven by the mean dependence of merger his- 
tory on mass - systems which have experienced more merg- 
ers (and more violent mergers) will have had more stars scat- 
tered to large radii in those mergers, raising the n^ of the outer 
component. At low masses, most systems formed in ~ 1 ma- 
jor merger at relatively low redshifts z < 1-2, so a typical 
n, ~ 2 - 3 is obtained. At high masses, systems often have 
their first merger at very high redshifts, involving a large de- 
gree of dissipation, forming a dense core, and then experience 
a number of dry mergers, building up a more extended dissi- 
pationless outer component and envelope and raising n,. 

The apparent trends of Sersic index with mass are much 
stronger if dwarf sph eroidals are included, as in many obser- 
vational samples (e.g. lGraham & Drived200'7tlFerrarese et alj 
12006.) . These systems have most likely not experienced any 
significant mergers - as a structural family, they are clearly re- 
lated to disks, not ellipticals, and their Sersic indices (typical 
Hs ~ 1) demonstrate this; that they dominate at low masses re- 
flects e.g. the dominance of disks and ellipticals, and is again 
driven by the cosmological dependence of merger history on 
mass. 
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Fig . 5 . — Correlation between black hole mass and estimated best-fit galaxy 
Sersic index n, (style as in Figure[T). Magenta stars show observed systems 
as in Figure O Owing to the coirelation between Mbh and M, , there is a 
correlation between Mbh and fij, but the scatter is significant at large Mbh 
(where there is an extended tail towards very high Wj, sensitive to the dynamic 
range fitted). 

Figure m compares the predicted and observed correlations 
between nuclear black hole mass and host bulge/spheroid 
properties (effective radius, velocity dispersion, stellar mass, 
and binding energy). Recall, motivated by observational and 
theoretical expectation s of a black hole "fundamental plane" 
(iHopkins et al.ll2007drbl) the fundamental "assumed" correla- 
tion here is between black hole mass and binding energy of 
the dissipational component of the bulge, determined at the 
time of that dissipation (time of gas-rich merger). Despite the 
complex interplay between this and the total bulge mass and 
size, dry mergers, and evolution in halo bulge properties, the 
z = correlations (includ ing the residual correlations observed 
in lHopkins et al.ll2007ch ) are accurately reproduced. The cur- 
vature seen in the Mbh - Re and Mbh - cr relations reflects a 
combination of the curvature seen in the correlations between 
these properties and stellar mass, and the increasing impor- 
tance of dry mergers at higher masses. 

We also consider the correlation between black hole mass 
and Sersic index in Figure |5] comparing to the data com- 
piled in iHopkins et al. (2007c) (an update of the fits in 
iGraham etal.1 120011: iGraham & DriveiJl2007l) . Owing to the 
dependence of «j on stellar mass, there is an indirect corre- 
lation between Mbh and n^, consistent with that claimed in 
IGraham & Drived(l2007l) . Those authors note that the correla- 
tion is not well-fitted by a single power-law; here we see such 
curvature. The reason is that most objects at M* < 10"^ Mq 
have similar merger histories, there is little dependence of n^ 
on M* or Mbh in this mass range, then a steep dependence, 
then again a relatively flat dependence at high Mbh- The same 
caveats regarding these Sersic indices apply as in Figure[3] In 
this model, the correlation between Sersic index and black 
hole mass may appear reasonably tight over a narrow range, 
but this is accidental, and there is no additional information 
on the black hole mass derived from the single Sersic index 
fits. 

3.1. The Effects of Dissipation 

3.1.1. What Drives the Amount of Dissipation ? 

Dissipation is one of the most important factors control- 
ling the properties of remnant ellipticals. We demonstrate this 
robustly with detailed observations and simulations in Paper 
II and Paper III. Here, we discuss how its effects are mani- 
fest in a cosmologically representative population, including 
systems which have experienced a number of dry mergers. 
Figure |6] shows the integrated fraction of our z = ellipti- 
cals which formed in dissipative starbursts (from gas in disks 
at the time of each merger), as opposed to dissipationlessly 
scattered stellar disks. Recall, dissipation allows gas to col- 
lapse to small scales, building up a dense central component 
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Fig. 6. — Mass fraction formed dissipationally (integrated mass fraction of 
the z = elliptical formed from gas in progenitor disks at the time of gas-iich 
mergers, in dissipational starbursts) as a function of stellar mass (style as in 
Figure[T). We compare (top right) with empirical estimates of this quantity 
in observed ellipticals from the study of a large sample of observed ellipti- 
cal surface brightness profiles and dissipation in Paper II and Paper III. We 
also compare (bottom) with observed disk galaxy gas fractions as a function 
of stellar mass, at z = iKannappan 2004; McGaugh 2005, blue diamonds, 
squares respectively), z = 1 (Shapley et al. 2005, green squares), and z = 2 
{Erb et al. 2006, red circles). The dissipational fractions at z = agree well 
with those observed, and reflect the gas fractions of their progenitor disks 
over the redshift range z ~ 0-2 where most systems had their last merger(s). 



in the stellar mass distribution and changing the phase-space 
distribution of the remnant. We compare with the observa- 
tionally estimated dissipational fractions in local ellipticals, 
determined in Paper II and Paper III. The agreement is good 
at each mass - in other words, the amount of dissipation ob- 
served in ellipticals can be explained by our simple model. 

What drives the trend of dissipation with mass? We find 
that it reflects the typical gas fractions of disks as a function 
of mass. That trend (in disks) may, of course, be a complex 
consequence of poorly understood processes such as star for- 
mation, cold gas accretion, and stellar feedback; but for our 
purposes here we are not interested in how this property arises 
in disks (simply what the observed properties of disks are, 
which are captured by construction in our halo occupation ap- 
proach). 

Figure |6] compares these integrated dissipational fractions 
with observationally estimated gas fractions of disks (as a 
funct i on of stel lar mass) at z = (see e.g. i Bell & ^ e Jong 
2000[lKannap"pan 2004; McGaugh 2005), z = 1 (Shap lev et al. 
2005h . and z = 2 (Erb et al. 2006). There is a systematic evo- 
lution in gas fractions with redshift, as expected, but the gas 
content of observed disks from z ^ - 2 appears to bracket 
the range in both our predicted dissipational fractions and the 
observed dissipational fractions of ellipticals. 

At the lowest masses, where the predicted dissipational 
fractions and observed disk gas fractions approach unity, the 
observationally inferred dissipational fractions are somewhat 
lower, appearing to reach a maximum near ^ 0.4. This most 
likely relates to the fact that extremely gas-rich mergers will 
not efficiently dissipate angular momentum and will leave 
dis k-dominated, rather th an elliptical remnants (for details, 
see iHopkins et alj |2008d). These subtleties, not included in 



our model here, are important for the evolution of bulge-to- 
disk ratios in low-mass galaxies, but do not significantly af- 
fect our predictions over most of the mass range of interest 
(M, > IO'^Mq, where f^,, < 0.5). 

At fixed mass, systems with earlier formation times tend 
to have slightly lower dissipational fractions; we emphasize 
that this is true when the systems are observed at z = 0. At 




10.5 11.0 11.5 
log(M.p,/Ms) 

Fig. 7. — Top: Fraction of ellipticals at a given stellar mass that have had 
N major mergers from z = — 6. Bottom: Median (soHd) and itlcr (dashed) 
range of redshifts of the last major merger for ellipticals of a given stellar 
mass (not the last gas-rich merger; where the number of mergers A' > 2, this 
last merger is often dry). Most of the mass in ellipticals (at least at L < a few 
L, ) is assembled by a relatively small number of mergers at relatively low 
redshifts z ~ 0-2. As a result, dissipational fractions roughly reflect those of 
disks in this redshift interval. 

their time of formation, early-forming systems may have had 
very large dissipational fractions, reflecting the characteris- 
tically higher gas fractions of disk progenitors at these red- 
shifts (we show this in greater detail in § |5] below). How- 
ever, these systems will also be in the most biased environ- 
ments, and so (in a ACDM context) undergo relatively more 
growth at later times, assembling much of their z = mass 
via mergers with less gas content (systems that have largely 
gas-exhausted and/or quenched) and correspondingly lower 
dissipational fractions. 

This simple correspondence is expected, because most of 
the mass in ellipticals is assembled in a relatively small num- 
ber of major mergers, at relatively low redshifts correspond- 
ing to this observed range (z ~ 0-2). Figure [T] shows the typ- 
ical number of mergers of ellipticals as a function of stellar 
mass, as well as the typical redshifts of the most recent ma- 
jor merger. The pred icted trend is similar to our estimate in 
iHopkin s et al.' ('2008c) as well as other cosmological models 
(see e.g. de Lucia & Blaizot 2007); most ellipticals at < L* 
have had just one major merger, with even massive ellipticals 
at '--^a few L* still having only a small number of such merg- 
ers. 

In IHopkins et al.l (l2008cl) we show that the fraction of sys- 
tems with e.g. only one major merger in their history versus 
multiple major mergers, or for which the last major merger 
was gas-rich, agrees well with the observed trend in the 
fraction of ellipticals with central cusps (power-law nuclear 
slopes), rapid rotation, or disky isophotal shapes as opposed 
to central cores, slow rotation, or boxy isophotal shapes (in- 
dicators of w et versus dry mergers, d iscussed in de tail in e.g. 
Paper II and iFaber et al.iri997l; iNaab et al...2006ai; ICox et alj 
|2006|) . The most recent major mergers, where the final stellar 
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mass (at least ^ 1 /2 of it) is assembled, occurs at relatively 
low redshifts, with a median z ^ 1 and spanning the range 
z ~ 0-2. Of course, the most massive BCGs form a tail in 
this distribution with much more complex histories, and for 
massive galaxies undergoing > 1 -2 major mergers, the most 
recent mergers are dry, but in any case the broad point re- 
mains: most of the z = mass in ellipticals is assembled in a 
relatively small number of major mergers at z < 2, and there- 
fore their dissipational fractions reflect those of disks over the 
same interval. 

This general conclusion (that major mergers with mass ra- 
tios <1:3, as opposed to more numerous minor mergers, dom- 
inate the mass assembly in mergers once gas is exhausted and 
star formation "quenched") is supported by calculations from 
other halo occupation models (Zheng et al. 2007) and cosmo- 
logical simulations (Mailer et al. 2006) . However, these cal- 
culations, as well as clustering estimates (Masiedi et al. 2007) 
and hydrodynamic simulations dNaab et alj20 07) suggest that 
as galaxies approach the most massive M* > lO'^M© BCG 
mass regimes, growth by a large series of minor mergers be- 
comes more important than growth by major mergers. 

Roughly speaking, these calculations suggest that, since 
most of the mass is in ~ L* systems, mergers with such 
systems dominate mass growth: so when galaxies are at 
masses < a few L*, their growth is dominated by major 
mergers, and at higher masses, their growth is dominated 
by progressively more minor mergers. Our predictions in 
this limit should therefore be treated with some degree of 
caution, in contrast with the bulk of the spheroid popula- 
tion at < L* (typically observed in field or low-density en- 
vironments, where the expectation for growth by a small 
number of major mergers since z ^ 2 -4 is reasonable; se e 
iBlanton et al! l2005t IWaiig et alJ 120061: iMasjedi et aP l2006l) . 
On the other hand, it is still unclear whether such minor merg- 
ers actually contribute significantly to mass growth even in 
the high-mass regime, or whether satellites are disrupted in 
this regime or kicked into orbits wh ere the merger time is 



longer than the Hubble time (see e.g. i Hashimoto et al] 
Tormenetal. 2004; Benson et al. 2004 HZheng et alj 
Kazantzidis et alji2007i:iBrown et al.ii20(m 
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We, in any case, have experimented with extending our 
modeling to include minor mergers, and find that this yields 
qualitatively similar conclusions (with moderate quantitative 
but no qualitative differences in the predicted properties of 
the most massive galaxies, and no significant differences oth- 
erwise). However, our predictions (and those of other halo- 
occupation based approaches) are less certain for minor merg- 
ers owing to ambiguities in e.g. satellite disruption and weaker 
constraints on the satellite mass function at small mass ratios. 

3.1.2. What Effects Does this Dissipation Have ? 

How does this amount of dissipation change the properties 
of the z = ellipticals? We consider this question in detail 
in idealized simulations and observed ellipticals in Paper IV: 
in short, increasing the amount of dissipation yields remnants 
with more of their stellar mass in a compact central starburst 
remnant. This yields a stellar remnant with a smaller effec- 
tive radius R^, and therefore overall a more baryon-dominated 
object (lower Mdyn/M*). We demonstrate that these general 
conclusions remain both qualitatively and quantitatively valid 
in a more general scenario here. In order to compare the 
effects with and without dissipation, we re-run our model, 
but without including the effects of dissipation: this effec- 
tively amounts to treating gas identically to stars in mergers. 




Fig. 8. — Same predicted spheroid scaling laws at z = as Figure[T] but 
in a model with no dissipation (gas is treated the same as stars in mergers). 
Without dissipation, the correlations are similar to those observed for disks, 
with a too-shallow size-mass relation (Sj oc MJ ''). Low-mass systems require 
dissipation to be as compact as observed - other effects (redshift evolution in 
disk sizes and dry mergers) are insufficient to reproduce this effect, and (even 
tuned) cannot simultaneously reproduce the slope and normalization of the 
size-mass relation. 
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Fig. 9. — The fundamental plane, as Figure[2] with no dissipation. Without 
dissipation, there is no tilt (the low-mass ellipticals here have too-large an 
Mdyn/^*> related to their too-large sizes in Figure[8). Fitting Mjyn oc M\^'^ , 
we obtain a ~ -0.1 to 0, similar to disks, as opposed to a = 0.2 observed. In 
this projection, dry mergers and redshift evolution in disk/halo sizes cannot 
give significant tilt. The pile-up in the lower-left panel is related to an effec- 
tive upper limit in the phase space density of disks/halos without dissipation. 



or equivalently treating initial disks as if they are all gas-free. 
Figures |8] & |9] plot the fundamental plane correlations of 
ellipticals at z = 0, yielded from this alternative model. Low- 
mass ellipticals, without dissipation, now have sizes similar to 
those of their progenitor disks, and much larger than the sizes 
observed. The slope of the size-mass relation, R^ (x M°^, re- 
flects that in progenitor disks and halos, and disagrees dra- 
matically with the observed slope for ellipticals Rg ex M"^. 
As we show in Paper 11, the trend of increasing dissipation 
at low masses, owing to increasing gas fractions in lower- 
mass disks, drives smaller mass ellipticals to relatively lower 
masses, steepening the size-mass relation and giving rise to 
the trend in Figure [T] Other trends, such as e.g. the rela- 
tion of dry mergers to mass, are insufficient to explain the 
observed size-mass relation (this is expected: dissipationless 
and dry mergers can only lower the density of objects; they 
can increase radii at large mass but cannot make low-mass 
ellipticals smaller or more dense as needed). Invoking evolu- 
tion in disk sizes and high-redshift mergers can make ellipti- 
cals more compact overall, but affects the size-mass slope in 
the opposite sense required (since the highest-mass ellipticals 
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Fig. 10. — Correlation between stellar mass and estimated best-fit galaxy 
Sersic index its, as Figure |3] with no dissipation. Without dissipation, Ser- 
sic indices remain too low at all masses (central densities cannot rise to the 
values observed) - the Sersic indices fitted to the whole galaxy are the same 
as those of just the dissipationless outer component (bottom); i.e. they can 
match the outer regions of elliptical profiles but fail to match the cenfi'al sur- 
face brightness of ellipticals (not dense enough at the center). 



will have their first mergers at the earliest times, they would 
be the most compact relative to disks today; the opposite is 
observed). 

This difference in our predicted size-mass relation is also 
reflected in the velocity dispersion-mass and fundamental 
plane relations, albeit more weakly, since the effective ra- 
dius of the stars enters only indirectly: much of a is set by 
the potential of the halo, which is not altered much by dis- 
sipation. Nevertheless, the Faber-Jackson relation disagrees 
with that observed (again, in the sense that ellipticals are not 
dense enough), and the fundamental plane tilt is erased (we 
obtain M^yn oc M°'^~'°, essentially identical to that observed 
in disks). 

These discrepancies are also apparent from the predictions 
in Almeida et al. (2007), wh o us e the semi-analytic models 
from both Bau gh et all (l2005h and lBower et al.1 ( l2006h to pre- 
dict galaxy properties in a similar manner as we have done, 
but treating gas identically to stars (in fact, their prescriptions 
for galaxy sizes and dark matter scalings are almost identi- 
cal to ours in this "no dissipation" case). As a direct result, 
at z = 0, their predicted size-mass relation is much too flat 
(Re oc M° ■') relative to that observed for spheroids, and the 
predicted fundamental plane has no tilt (in fact it is slightly 
tilted in the incorrect sense Mdyn oc M°**""'^). They consider 
the differences between the two semi-analytic models, as well 
as the inclusion or removal of adiabatic contraction in the 
galaxy halos, supernova and AGN feedback, self gravity in 
the baryons, and systematically higher or lower-energy orbits, 
and find that none of these significantly change the predicted 
fundamental plane scalings. In short, at z = 0, dissipation 
(making galaxy stellar mass distributions more compact) is 
much more important and more significant to the fundamen- 
tal plane correlations than any differences between various 
models for galaxy formation and evolution - only dissipation 
works as a viable explanation of the observed scalings. 

The Sersic profiles in the absence of dissipation will be 
those of the outer/dissipationless components, already shown 
in Figure [3] (in this case, both the Sersic index of the entire 
profile and the Sersic index of just the dissipationless com- 
ponent will be the same, and be identical to that of just the 
dissipationless component in our full model). We directly 
compare this with the observed Sersic indices in Figure [TOl 
Although this matches the outer profiles of observed systems. 
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Fig. 1 1 . — Same predicted spheroid scaling laws at z = as Figure[T] but in 
a model with no dry mergers (structural change in spheroid-spheroid mergers 
is suppressed). Dry mergers do not affect the predictions at low mass, but at 
high mass (A/, 2> 1O"M0), a lack of dry mergers means that ellipticals are 
not as large as observed. The sense of residual correlations at high mass is 
now reversed - the oldes t systems at fixed mass or a are now the smallest, in 
contrast to observations jGallazzi et aU200^ ; IGraves et aU2008l) 

it clearly is not the same as the Sersic indices obtained fitting 
the observed galaxies to a single Sersic law over the entire 
dynamic range - the high Sersic values (for the entire profile) 
Us ~ 5- 10 observed can only be reproduced with some dissi- 
pation to make a dense central concentration of light (match- 
ing the steep rise at small radii of large-n, Sersic profiles). 
This ultimately points to the same issue as the size-mass rela- 
tions: dissipation is needed to raise the central stellar densities 
in order to match observed correlations. 

Without dissipation, there is of course no way to grow the 
central black hole; there is no correlation between black hole 
mass and host galaxy properties for us to predict. We could 
in principle assume that black hole mass is dissipationlessly 
conserved in mergers, but initially set by disk masses - how- 
ever, this would predict that it is disk, not bulge properties 
that set the black hole mass, in direct contradiction with ob- 
servations. We could assume that Mbh strictly traces the stel- 
lar mass or velocity dispersion in the bulge (or in the dissi- 
pationless component of the bulge), but as demonstrated in 
iHopkins et al.l (l2007d) . this fails to explain residual correla- 
tions between black hole mass and bulge binding energy at 
fixed M* or a. 

3.2. The Effects of Dry Mergers 

Having analyzed the effects of dissipation, we now turn to 
the effects of dissipationless or dry mergers. Since essen- 
tially all disk-dominated galaxies do, in fact, have significant 
gas fractions, this in practice refers to subsequent spheroid- 
spheroid re-mergers. What effect do such mergers have on 
the z = scaling relations of ellipticals? In order to study this, 
we re-run our model, including dissipation but excluding such 
dry mergers (we have also considered allowing them to occur 
in the sense that they add stellar mass, but not allowing them 
to alter the structural properties of the galaxy; the results are 
similar). 

Figures[TT]&[T2]plot the fundamental plane correlations of 
ellipticals at z = from this alternative model. At intermedi- 
ate and low masses, the number of mergers in our full model 
is small (see Figure |7]), and dissipation is dominant, so the 
correlations predicted here are essentially identical to those in 
Figures [T] & |2] At high masses or luminosities (>a few L*), 
however, there is an appreciable effect on the projections of 
the FP. The highest mass systems have their first mergers at 
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Fig. 12. — Fundamental plane, as Figure |2] with no dry mergers. The 
difference is minimal (although there is slightly less tilt), since dry mergers 
tend to move systems parallel to the FP. 

early times, when their progenitors are more gas-rich, and are 
therefore compact when first formed. Without dry mergers to 
puff them up at later times, they do not have effective radii 
as large as observed. The difference in the scaling laws over 
the dynamic range observed in Figure [TT] is subtle, becoming 
dramatic only at the highest masses. 

There is, however, a significant effect in the residual cor- 
relations with e.g. size and merger history at fixed stellar 
mass or velocity dispersion. Without dry mergers, then 
(because of the same effect), the oldest systems at fixed 
mass would be the smallest. This is the opposite of ob- 
served trends (Galla zzi et alj|2006t [Graves et alj |2008'): older 
BCGs are extremely clustered, and so (unless CDM mod- 
els are seriously incorrect) had their first mergers at early 
times, where spheroids are observed to be compact (see §|4|i. 
The observations that e.g. older BCGs tend to be the most 
extended systems dvon der Linden et alj|2007l; iBernardi et alj 
12007) therefore demands some dry mergers in th eir history 
iHausman & Ostrikeriri978l:lHernquist et al.lfT993l) . However, 
as shown in Figure |7] the number of such dry mergers (es- 
pecially at z < 1 - 2) is still relatively small, so this is con- 
sistent with the ~ 1 major dry merger since z ^ 1 which di- 
rect observations of dry merging pairs (van Dokkum 2005; 
iBell et ai] 120061 : iLin et al. 20 0J|) and constraints from mass 
function evolution (Bundvetal] 12005: 



Perez-Gonzalez et 



2007al:IZheng et al.ll2007i) suggest 
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iBorch et alJ 120061: 
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The fundamental plane itself is less affected by dry merg- 
ers. This is not surprising: numerical simulations demon- 
strate that both major and minor dry mergers tend to move 
systems parallel to the fundamental plane (see Pa per IV and 
iBovlan-Ko lchin et alj200HRobertson et alj2006l) . Where an 
individual galaxy ends up on the FP is therefore affected by its 
dry merger history, but the FP itself is not very much altered. 
There is a very weak effect because of the evolution in the FP 
discussed in §2] but it is much less apparent here than in e.g. 
the size-mass relation. 

Figure [13] shows the Sersic indices as a function of mass in 
the absence of dry mergers. Without repeated dry mergers to 
scatter stars at large radii and build up a more extended en- 
velope, the profile of the outer/dissipationless component is 
self-similar at all masses (failing to reproduce the weak, but 
significant dependence observed). Fitting the entire profile to 
a single Sersic index, the effects of dissipation still contribute 
to a dependence of this index on mass (again emphasizing that 
with such a single-component fit, the Sersic index does not 
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Fig. 13. — Correlation between stellar mass and estimated best-fit galaxy 
Sersic index n,, as Figure[3] with no dry mergers. Without dry mergers, Sersic 
indices remain flat at the values characteristic of gas-rich merger remnants. 
The large Sersic indices at high masses (reflecting the extended envelopes of 
the most massive ellipticals) are not reproduced. 
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Fig. 14. — Congelation between black hole mass and host spheroid proper- 
ties, as Figure|4] with no dry mergers. There is fairly little effect here - dry 
mergers conserve both spheroid binding energy and total black hole mass, so 
move systems parallel on the black hole fundamental plane correlations. The 
(weak) change along projected correlations (e.g . Mbh ~ "") reflects the small 
structural changes of the remnants in Figure fTll 

robustly track merger history as it does in a two-component 
decomposition). Both the effects of dry mergers and dissipa- 
tion (in particular combinations) are needed to reproduce the 
observed trends. 

Figure[T4]shows the correlations between BH mass and host 
properties in this model. The effects here are quite weak. 
Since stellar mass and black hole mass are conserved in dry 
mergers (and the projected correlation is roughly Mbh « M*) 
there is almost no change in that correlation with or with- 
out dry mergers. Total spheroid binding energy is also con- 
served (which leads to velocity dispersion being nearly con- 
stant) in dry mergers, but the correlation here is not linear 
(although it is the more fundamental correlation in our mod- 
els). If we assume the "initial" correlation is the power-law 
Mbh oc fi^'J (or Mbh oc a'^ for the Mbh - o" relation), then (as- 
suming M* ex a"^ , which gives Ebui « mJ ) after a merger 
of mass ratio / (/ < 1) two systems initially on the correla- 
tion will move a factor (1 +/)/(l +/-'/^)"^' off the correlation 
(likewise for the Mbh - cr relation. This amounts to a ~ 20% 
(~ 0. 1 dex) effect in major mergers - so we expect (given the 
~ 0.3 dex intrinsic scatter in the correlations) it will only be 
noticeable at the most massive end where systems have under- 
gone a large number of dry mergers. We do see this, but over 
the dynamic range of the present data, there is little distinc- 
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Fig. 15. — Predicted evolution in spheroid sizes with redshift. We show 
the predicted median size of spheroids at a given stellar mass, relative to the 
median size of objects of that mass at z = 0. Shaded range is the uncertainty 
owing to both cosmology and how rapidly progenitor (disk and halo) prop- 
erties evolve. Solid lines are simple power-law fits (Equation |7). Dashed 
line shows the power-law fit corrected (approximately) to the Re that would 
be measured in optical/UV bands (as opposed to the half-mass radius: stellar 
population effects tend to make very young ellipticals with recent central star- 
bursts more concentrated in short-wavelength light). D otted li ne is the predic- 
tion from the semi-analytic model of Khochfar & Silk (2006). Observational 
estimates for g alaxies in the appropriate stellar mass intervals are shown from 
IMcIntosh eT^ . (2005, circles), Truiill o et al.l a2006i . stars), iZirm et alj l2007t 
square), and[yan Dokkum et al. (2008, diamond). Progenitors are more gas- 
rich at high-z, yielding mergers with more dissipation and therefore smaller 
remnants. At high masses, low-z disks are gas poor (and dry mergers have 
been more important), so the relative difference is larger; at low masses, there 
is little room for gas fractions to increase, hence weak evolution. 

tion that can be clearly drawn - the scatter in various proper- 
ties (and the fact that e.g. neither Mbh - o" nor Mbh -M, is the 
most fundamental coiTelation in this model) mostly washes it 
out. In terms of the correlations with effective radius R^, there 
are more noticeable changes, directly reflecting the different 
size-mass relation in Figure [TT] 

4. EVOLUTION OF SCALING LAWS AS A FUNCTION OF REDSHIFT 

We have shown how dissipation and dry mergers alter the 
coiTelations of ellipticals at z = 0. Of course, gas fractions of 
their progenitors evolve with redshift, and dry mergers pref- 
erentially occur at low redshift. In addition, other properties 
of the progenitors, such as disk and halo sizes, will evolve. 
We therefore expect there could be evolution in these scaling 
laws, and make predictions for this here. We adopt our stan- 
dard model, but construct samples as they would be observed 
at a given non-zero redshift. 

4.1. Predicted Evolution In Our Standard Model 

4.1.1. Spheroid Sizes 

Figure[T5]shows how the median effective radii of spheroids 
at fixed stellar mass are predicted to evolve with redshift. We 
vary the assumptions described in §|2] most importantly (other 
uncertainties being relatively negligible at the redshifts of in- 
terest here) whether (and how strongly, within the observa- 
tional limits) disk sizes gas fractions evolve with redshift, and 
show the resulting aUowed range. We compare with obser- 
vations _offlie_s2heroid size-mass relatio n from Truiillo et al. 
J2006"). Mcintos h et al.l (12005), van Dokkum et al. (20 08|) and 
Zirm et al.. (.2007 1) (see also .Franx et al-2008:.Damjanov et al.l 



20081) . Although the observational constraints are weak, they 
agree with our predictions at redshifts z ~ 0- 3. 

Evolution in galaxy gas fraction, which directly translates 
to evolution in the amount of dissipation in mergers, drives 
this behavior This is expected - in § 13.11 and in greater de- 
tail in Paper II, we show that the degree of dissipation is the 



dominant factor determining the stellar effective radius. Disk 
gas fractions are larger at high redshift, so in mergers, a larger 
mass fraction will be formed in a compact dissipational star- 
burst, yielding a smaller stellar remnant. Dry mergers have 
also had less time to act and puff up remnants at high redshift. 
If disks are more compact at high redshift, this drives further 
evolution; however, observational constraints suggest that this 
is relatively weak (see § 12.3b . and so it is less important than 
the evolution in gas fractions. 

This also predicts that the relative size evolution should be 
stronger in high-mass systems. Fitting these predictions to 
simple scaling laws of the form 

{Re{M, \z))=(l +z)-^ {Re(M, I z = 0)) (7) 

we obtain /3 = (0.0-0.2,0.24,0.48,0.64) (±0.05 or so 
for each) for M, - {\(f , 10'", 10" lOi-)M0 (roughly /3 w 
MAX[0.23 log(M,/10''MQ), 0.2]). Low mass disks are still 
gas-rich at z = (and have generally not experienced many dry 
mergers), so there is limited room for evolution (the evolution 
we do predict is almost entirely driven by whatever evolution 
in progenitor disk sizes is assumed, hence the quoted range 
0.0-0.2 in the lowest-mass bin). At high mass, however, dry 
mergers have been increasing spheroid sizes for a significant 
redshift interval. Furthermore, disks at high mass at low red- 
shift are relatively gas-po or; by z ^3, the y have doubled or 
tripled their gas fractions (lErb et al.ll2006h . The relative size 
difference is therefore much more pronounced. 

These predic ti ons are similar to those made by 
iKhochfar & Silkl (l2006h . who adopted a full semi-analytic 
model to estimate the history of disk galaxies, accre- 
tion, and star formation, and then modeled disk sizes 
by assuming effective radius is proportional to the mass 
fraction in the dissipationless (non-starburst) component; 

Re(Mf,,Z) = Re(M^,,Q) X (/dissipationless [z] //dissipationless [0]). 

For comparison, we show their predictions in Figure [15] At 
intermediate and low masses, they are similar to ours. This is 
because their simple model captures the key qualitative aspect 
which drives our predicted evolution, namely the evolution 
in disk gas fractions with redshift, making high-redshift 
ellipticals more dissipational than their low-redshift counter- 
parts. Ultimately, their determination of the size evolution 
with dissipational fraction is not a bad approximation over 
intermediate gas fractions (to the accuracy here). However, 
there is a signifi cant difference in the p redictions at the 
highest masses - IKhochfar & Silkl (1200 6*) predicted much 
stronger evolution than we do. This is the regime where 
our attempts to improve on their estimates are of particular 
importance. 

First, their predictions are tied to some assumptions and 
modeling of disk galaxies in their semi-analytic model; this 
is precisely why we have tried to be as empirical as possi- 
ble in constructing our progenitor galaxies. They note that 
their strong evolution in this mass bin is the result of a high- 
redshift overcooling problem in describing massive galaxies 
in their semi-analytic model (a well-known difficulty without 
strong feedback). As a consequence, their disk gas fraction 
evolution to z ^ 2 is much steeper than observed - i.e. "pro- 
genitor" disks at these masses and redshifts are demonstrably 
too gas-rich and in halos of too low a mass. lErb et al] (l2006l) 
see typical gas fractions for M* > 10" M© galaxies at z ~ 2 of 

- 20% (whereas Khochfar & Silk (2006) predict - 70-80%) 

- even if the ellipticals observed at z = 2 formed from higher- 
redshift mergers with gas fractions say, double this value, this 
predicts factor < 2 size evolution at z = 2, similar to our es- 
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timates. By using a halo occupation approach, we avoid this 
problem. 

Second, our size estimates are calibrated directly to numer- 
ical simulations: we track the evolution of multiple galaxy 
components in merger simulations and use this to design our 
prescriptions. Of particular importance, we include the effects 
of dry mergers appropriately and extrapolate our prescriptions 
to arbitrary dissipationa l fractions: a t high /dissipationai, the sim- 
ple size estimator from lKhochfar & Silk (2006) clearly must 
break down, as galaxies will still have finite sizes. This direct 
calibration with simulations avoids ambiguity in e.g. degen- 
eracies between cosmological effects and assumptions about 
how spheroid sizes scale, and allows for more robust predic- 
tions. 

Nevertheless the reason for our predicted evolution, and 
its sense, is essential ly the same as that identified in 
iKhochfar & SiUd (l2006h . Ellipticals at high redshift have, on 
average, formed a larger fraction of their mass in dissipational 
starbursts (owing to their progenitors being, on average, more 
gas-rich) at these redshifts. Dissipation allows this component 
to be significantly more compact than the dissipationlessly vi- 
olently relaxed stars from the progenitor disks, giving rise to 
a more compact remnant. 

At the highest masses (^ 10'' M0), the evolution we pre- 
dic t is som ewhat weaker than that observationally inferred 
bylZirmet al. (2007); van Dokkum et al. (2008); Franx et al] 
(|200 8!). However, as demonstrated in Paper 1, this can be 
explained by an additional bias - if systems at high z are 
very young (ages < 0.5 - 1 Gyr) remnants of gas-rich merg- 
ers (as we predict precisely these systems are, and as obser- 
vations of their stellar populations suggest, see Kriek et aT] 
I2OO6I) and observed in even rest-frame optical wavelengths 
(let along rest-frame UV), the gradients in M, /L introduce a 
bias towards smaller R^ (there may also be some related, but 
weak, biases in these stellar population parameters and stellar 
masses, see e.g. Wuyts et al. 2007). The younger stellar popu- 
lation from the starburst - i.e. the dissipational component - is 
significantly brighter in B-band at ages < 1 Gyr, so this dom- 
inates the fit, yielding a smaller R^ (equivalent to artificially 
further increasing the dissipational mass fraction - this is why 
it actually looks as if the Khochfar & Silk (2006) prediction 
provides an acceptable match to these objects). 

In line with this expectation, the sizes we obtain if we 
ignore the dissipationless components of our predicted z > 
2-3 systems are consistent with th e obser v ations from 
IZirm et al. (2007); van Dokk um et all (l2008h ; iFranx et all 
(l2008h : iDamjanov et al.l (2008). Essentially, this effect can 
yield a bias of an additional factor ^ 2 (see Figure 20 in Pa- 
per I) for gas-rich systems. It will not be a problem at lower 
redshifts, both because systems are older (the effect is neg- 
ligible at starburst ages > 1 Gyr, or even earlier, as induced 
metallicity gradients from the starburst can partially cancel 
it out), and because gas fractions are lower (we expect sig- 
nificant bias only for /dissipational ^ 0.3-0.4). Furthermore, 
there is no bias within the starburst light (there is not a strong 
gradient within the starburst itself), so for low-mass systems 
(which are largely dissipational at every redshift), there is no 
significant effect. It is precisely for large mass systems at high 
redshift that we expect this to be an important concern, and 
measurements that could ultimately estimate e.g. color gradi- 
ents and stellar population gradients in these systems will be 
necessary for more detailed constraints. 

4.1.2. Velocity Dispersions 
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Fig. 16. — Predict ed e volution in spheroid velocity dispersions with red- 
shift (style as Figure [15). Despite significant evolution in Re at fixed mass, 
the role of dark matter halos (which evolve more weakly) in setting a yields 
much less evolution in a for the same systems. Lin es are the simple predic- 
tion given the fitted scaling of Re with z (Figure [Tst and Equation Js). 

Figure [16] shows the corresponding evolution in velocity 
dispersion with redshift. The evolution is much weaker than 
might be intuitively expected: if a^ ~ GM /Re, then the factor 
^ 3 evolution in size in massive systems in Figure [15] would 
translate to factor ^ \/3 evolution in a. Instead, we predict 
such systems have roughly the same or at most a factor ^1.3 
larger a. This is because, at z = 0, dark matter plays an im- 
portant role in setting the velocity dispersion - the potential at 
r = has a contribution ^ GMhaio/^haio (here /?haio is the ef- 
fective radius of the halo, not the virial radius; /?haio ^ Rvu/c, 
where c is the halo concentration), which is comparable to or 
larger than ^ GM^jRe in most systems (although /?haio ^ Re, 
Mhaio ^ Af* by about the same factor). This is especially true 
for the high-mass ellipticals (for which the evolution in R^ is 
strongest) which, at z = 0, are the most dark-matter dominated 
within their stellar R^. At a given halo mass, i^haio evolves 
weakly - the evolution in halo concentrations almost com- 
pletely offsets the evolution in virial radii (this reflects the fact 
seen in most simulations that massive halos build inside out - 
the cen tral potential is set first and then the outer halo builds 
up; see iBullock et al.ll200H: [Wechsler et al.ll2002l) . so assum- 
ing the ratio M*/Mhaio does not evolve much with redshift 
(as inferr ed in most observations and halo occupation mod- 
els; see § 12.21 ). then the contribution of the halo to a (crhaio) 
evolves weakly at fixed M*. If anything, it can decrease - if 
the baryonic component is more compact at high-z, then the 
enclosed dark matter mass in the stellar R^ is smaller 

Roughly, then, if we consider the observed velocity disper- 
sion to reflect a galaxy potential and halo potential which add 
linearly, we obtain a^ ex (M*/7?e+Mhaio/^haio)- If we consider 
systems of fixed M*, and M\aSsSM*) and /?haio(A^haio) evolve 
weakly, then we obtain 



{a(z\M,)) _ 1 
(a(0|M,)) VT+7 



/7 + 



(Rem 

(Reiz)) ' 



(8) 



where 7 = (Mhaio/^haio)/(-^*/^f) ^ 1 -2 is the relative frac- 
tion of the central potential contributed by the dark matter at 
z = 0. For the values of 7 we estimate (^ 1 at M* ^10" Mq 
and ~ 2 at M* ~ IQ^^Mq), this simple expectation fits the 
observed trends well. 

4.1.3. The Fundamental Plane 

Combining these trends, we anticipate the evolution in 
Miyn/M^ shown in Figure [TT] Since Mdyn = ka^Re/G, the 
stronger evolution in R^ largely drives the evolution in Mdyn- 
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Fig. 17. — Predicted evolution in the ratio of dynamical mass estimator 
M(jy„ = kc r Re/G to stellar mass M, with redshift (as Figure [Tst . Compar- 
ing Figures [T5l & ll6l the evolution is primarily driven by evolving Re - more 
dissipational, compact stellar remnants are increasingly baryon-dominated 
inside their stellar Re. Again, low-mass systems are already highly dissipa- 
tional and baryon-dominated inside Re a\ z = 0, so there is little room for 
evolution. 

We emphasize that this reflects a real difference in the en- 
closed matter within the stellar R^ (as we have defined it, there 
are no significant structural or kinematic non-homology ef- 
fects). The decrease in Mdyn at fixed M» in the highest-mass 
systems is a consequence of the fact that at low-z, they have 
small dissipational content and have larger stellar R,,, so have 
larger dark matter masses enclosed in Rg (and thus higher en- 
closed mass in Re). At high-z, their progenitors are gas-rich, 
so they are formed in highly dissipational mergers and have 
compact stellar distributions, which enclose less dark matter 
in the stellar /?(.. 

Figure [18] shows the implications of this evolution for the 
FP as a function of redshift. At low redshift, the observed 
tilt {a « 0.2) is recovered. At high redshift, however, all pro- 
genitors are gas-rich, so there is not much difference in the 
dissipational content at low and high masses - meaning their 
compactness (relative to e.g. their dark matter halos or pro- 
genitor disks) is no longer a strong function of mass. The 
high-mass systems, being nearly as dissipational as low-mass 
systems, are similarly baryon-dominated in their stellar R^, 
and there is less tilt. 

At the level predicted here, it may be difficult to observe 
the predicted FP evolution. Robust velocity dispersions and 
effective radii in the same rest-frame band (avoiding a bias 
towards smaller Re as observations probe closer to the rest- 
frame UV) may be obtainable in large samples at z < 1, but 
the predicted evolution in that interval is weak - within the 
systematic uncertainties at z ~ 0. 

This is consistent with observatio nal constraints i n this 
redshift interval from weak lensing ("Hevmans et al.' |2006|) 
and optical studies (di Serego Alighieri et al. 2005; Tr eu et al.l 
2005; van der Wei et al. 2 0051 : Ivan D okkum & van der Marell 
2007; .Brown et al...2008.) . These observations do see evolu- 



tion in optical bands, (in the opposite sense to that predicted 
here, namely an increasing appare nt tilt), but find that this 
owes to stellar population effects (Idi Serepo Ahghieri et al.l 
l2005h - the evolution observed in terms of Mdyn and M» 
is either negligible or slightly negative (as predicted here). 
Any model where the stellar populations of lower-mass el- 
lipticals are systematic ally younger (as is predicted here; see 
iHopkins et al.l (l2008cl) for details) will predict the observed 
trend in optical bands: if, say, a low-mass system at z = has 
a stellar population age of ^ 7Gyr whereas a hi gh-mass sys- 
tem has an age of '^ 12Gyr, as observed (see e.g lTrageretal.1 
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Fig. 18. — Best-fit fundamental plane (Mdyn oc MJ^" where a is the FP 
tilt) as a function of redshift. The evolution in Figure FTTl is reflected here: at 
high-z, even large-mass disks are gas-rich so high-mass systems experience 
significant dissipation, bringing their stellar Re in and making the remnant 
more baryon-dominated within Re (lowering the dark matter fraction and 
therefore Mdyn within Re. at fixed M»). The tilt therefore weakens slightly 
with redshift (there is less difference in the degree of dissipation - the driver 
of tilt - at high-z). Solid and dashed lines allow (do not allow) for progen- 
itor disk size and stronger (weaker) disk gas fraction evolution, bracketing 
the range allowed by observations (representing the uncertainty in our pre- 
dictions). 



2000 [ iNelan etaH I200I iThomas et all I200I iGaUazzilFaD 
20061) . then inverting passive evolution to z = 0.7, the low- 
mass (younger) system would have a lower M^/Lb by a fac- 
tor > 10, whereas the high-mass system would have increased 
M^JLb by a factor of just w 2. It is therefore challenging to 
observe the weak evolution here (^ 0.1 dex), given expected 
stellar population effects of magnitude ^0.5-1 dex. 

However, this degree of evolutio n is importan t for a num- 
ber of subtle effects: for example, van DokkumI (i2008) uses 
the evolution of the optical FP with redshift to infer evolution 
in elliptical mass-to-light ratios (under the assumption that the 
physical - i.e. stellar mass - FP is invariant). This is broadly 
fine, as the optical MjL for any reasonable stellar population 
age evolves much more rapidly than the physical FP evolu- 
tion in Figure [18] But the authors then compare this change 
in MjL with time to the mean colors of ellipticals as a func- 
tion of time, and argue that the relation between the two sug- 
gests evolution in the stellar initial mass function. At this level 
(as the authors acknowledge), the comparison is sensitive to 
evolution in the physical (stellar mass) FP at the ^ 0.1 dex 
level - if massive galaxies at z ~ 1 have lower Mdy n jM^ by 
~ 0.1 dex (similar to what we predict in Figure [TTD this will 
bias the inferred MjL ratios (under the assumption of no phys- 
ical FP evolution), but not, obviously the galaxy colors, by an 
amount comparable to the significance of the effect seen (in 
other words, allowing for this evolution, the evidence for any 
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Fig. 19. — P redi cted evolution in Mbh at fixed stellar mass with redshift 
(style as Figure [Tsl . Red circles with arrows sho w the upper limits from ob- 
served evolutio n in spheroid mass density in Hopkins et al.l l2006cD (see also 
IMerloni et alJ2 004). applicable to ~ M, galaxies. Blue squares show the ob- 
servational estimates of evolution for massive black holes and galaxies from 
IPeng et al. (2006). More dissipation at high redshifts means more work for 
the black hole to do before its growth self-regulates, giving rise to larger Mbh 
(for fixed feedback efficiency); however, the net effect is relatively weak. Se- 
lection effects (Lauer et al. 2007a) may explain the small difference between 
Mbh/M, observ ed a nd predicted. The trend is mass-dependent in the same 
manner as Figure [Ts] 
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Fig. 20. — Predicted e volu tion in A/bh at fixed velocity dispersion a with 
redshift (style as Figure [Ts). The complex interplay between evolution in 
MbH: Re, and cr yields weak trends here that are not necessarily monotonic. In 
general the predicted Mbh/"" evolution is weaker than (or inverse to) that in 
Mbh /M, , but at z ~ 2 - 3 massive black holes may be overmassive relative to 
cr by a factor ^2-3 (owing to the evolution in Mbh/M« and weak evolution 
ina/Mt). 



evolution in the stellar IMF may be less strong). 

At this level of detail, physical evolution in the FP in 
the manner predicted here may change this comparison, and 
should be considered in future, more detailed comparisons 
with observations. However, the effects here may not be im- 
portant so_Jong_asthe_observed sample is appropriately se- 
lected: |vinDo^^^(|2008i) focus on satellites in rich clusters, 
which are not likely to undergo significant subsequent merg- 
ing (especially as compared to the central galaxies modeled 
here, particularly those in moderately dense field and group 
environments that contribute much of the "lever arm" to the 
FP tilt evolution predicted). This emphasizes the importance 
of appropriate sample selection when comparing these quan- 
tities, and the need for further study to determine exactly how 
our predictions generalize to populations with more complex 
or distinct histories (such as satellites) and therefore pertain 
to the observations. 

4. 1 .4. Black Hole Masses 



Figure [19] & |20] show the evolution in BH masses at fixed 
stellar mass and velocity dispersion, respectively. Because 
systems are more dissipational at high redshifts, the binding 
energy of the baryonic material which must be supported or 
expelled to halt accretion is higher (at fixed stellar mass). As 
a consequence, the typical BH masses are larger We dis- 
cuss this evoluti on, its physical causes, and consequences, in 
greater detail in iHopkins et alj (l2007bl) . Here, we note that 
the same conclusions hold, adopting a more complex cosmo- 
logical model than was considered in that paper. 

The evolution is sim ilar to tha t suggested by a number of re- 
cent observations (e.g.lShields'e t al. 2006; W alter et alJl2004l: 
iPeng et alJ l2006t |Woo et al.' ^2006; Salviander et al.' i2006b 
and indirect estimatio ns (Hopkins et al. 2006c; Merloni et al] 
■2004"; 'Adelberger & Steidel 2005; W vithe & Lo eb 2003 
Hopkins et al. 20 07d; Lidz et al. 2006) but we emphasize that 
it is not strong (it is in fact on the lower ~ Icr end of the 
estimated evolution) - typical BHs are a factor < 2 larger at 
z ^ 2-3. Larger differences from the local relations (in partic- 
ular in very luminous quasar populations) may result from se- 
lection biases (Lauer et al. 2007a), where the most luminous 
quasars (most massive BHs) are likely to represent the upper 
end of the scatter in the BH-host correlations. The combina- 
tion of such a selection effect with o ur predicted evolution is 
a very good match to the evolution in lPeng et al.l (|2006|). 

We also compare with the constraints in iHopkins et alJ 
(l2006c), derived from observations of the evolution in the 
spheroid mass density with redshift (since the black hole mass 
density cannot, in any reasonable model, decrease, the maxi- 
mum evolution in the typical ratio Mbh/A^* is limited by the 
fact that the resulting black hole mass density, predicted from 
the observed spheroid mass functions, must not be higher than 
that observed at z = 0). This constraint is most directly ap- 
plicable to ^ L^, galaxies, since this is where most of the 
mass density of the universe resides. These constraints limit 
evolution to a factor < 2 at z = 2, consistent with our pre- 
dictions. Comparing to similar estimates from lMerloni et alJ 
(J2004i), who adopt a similar approach but model the evolu- 
tion in the black hole mass density from quasar luminosity 
functions, yields a similar constraint and expectation of weak 
evolution, Mbh /A^* ^ (1+z)""*""'^, consistent with our predic- 
tions. 

The evolution in Mbh/c is somewhat more complex, re- 
flecting the full interplay between evolution in Mbh with 
spheroid binding energy and evolution in a with redshift (dis- 
cussed above). Because deeper potentials will, in general, be 
reflected in higher a values, we expect weaker (or even in- 
verse) evolution here, relative to Mbh/M,^ (where M^, was not 
a measure of the central potential depth, so deeper potentials 
at fixed M* translated to higher Mbh)- However, because a 
reflects a combination of the halo and galaxy over a signif- 
icant dynamic range, the results are non-trivial and can be 
non-monotonic in redshift. For example, in our most mas- 
sive bin, the predicted evolution in black hole masses is suf- 
ficiently strong (and evolution in a relatively weak as a re- 
flection of the tradeoff between dark matter and baryonic po- 
tential) that evolution to larger Mbh/c (albeit at the factor 
< 2 - 3 level) may be expected, consistent with rec ent obser- 
vations dShields et alJl2006l;[Salviander et alj|2006l) . although 
again selection effects are probably important and may ex- 
plain cases where the inferred evolution appears to be much 
larger. 

4.2. Effects of Dissipation 
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Fig . 21. — Predicted evolution in spheroid sizes with redshift, as Figure fTs] 
but in a model with no dissipation (gas is treated the same as stars in mergers). 
Even allowing for evolution in disk sizes with redshift, there is essentially no 
evolution predicted in spheroid sizes without dissipation (other effects can 
make the evolution in spheroid sizes weaker than that in disk sizes, which is 
already observed to be weak). 



Fig. 23. — Predicted evolution in spheroid sizes with redshift, as Figure fTsl 
but in a model with no dry mergers (structural change in spheroid-spheroid 
mergers is suppressed). The predicted evolution is essentially identical at 
M, < 1O"M0, but weaker for very massive systems, which can no longer 
be puffed up at low redshift to match observations. 




Fig. 2 2. — Predicted evolution in A/bh at fixed stellar mass with redshift, 
as Figure [19] but in a model with no dissipation. The predicted evolution for 
massive black holes is almost entirely a consequence of dissipation. 

We have argued that these effects are largely driven by vary- 
ing degrees of dissipation as a function of redshift. We briefly 
illustrate this here by repeating our predictions for the size 
evolution of ellipticals (which, as we showed in § ID is the 
most pronounced evolution) in the version of our model where 
we ignore dissipation or treat gas identically to stars (as in 
§ |3.1.2| i. Figure|2T|shows the results. Without evolution in gas 
fractions driving evolution in the degree of dissipation, only 
the (weaker) evolution in progenitor disk sizes has a notice- 
able effect, and the evolution is substantially suppressed. In 
fact, the predicted evolution can, in this case, be even weaker 
than that in the progenitor disks, because ellipticals with a 
given final mass tend to undergo their mergers over a simi- 
lar redshift range - so their final sizes will reflect whatever 
the sizes of disks were at that time, whether or not those disk 
sizes evolved later. 

A similar (relative) effect is seen for the evolution in Mdyn 
and the FP, as well as BH masses. We show the evolution in 
black hole mass at fixed stellar mass predicted without dissi- 
pation in Figure |22] As expected based on the evolution in 
Re, the evolution here is essentially eliminated without dissi- 
pation. 

4.3. Effects of Dry Mergers 

At the high-mass end, there is also some evolution because 
systems merging early will have dry mergers puff up their 
sizes at later times. Figure |23] shows the evolution in the size- 



mass relation, with dry mergers suppressed (as in § 13. 2b . The 
difference here is real, but small. Dry mergers - while impor- 
tant for the size evolution of individual objects - do not have 
a dramatic effect on the overall evolution of the size-mass re- 
lation. If the "pre dry-merger" size-mass relation is a power- 
law. Re ex M", then by the energy conservation argument in 
Equation (|4]l, a merger with mass ratio / (/ < 1) of two sys- 
tems initially on this correlation will have a final effective ra- 
dius a factor (1 +/)^/(l +/^~") larger than the primary. Rela- 
tive to the size-mass relation, the system has moved above the 
relation by a factor (1 +/)2""/(l +/^""). 

For observed values of ot in disks or ellipticals (^ 0.3 and 
~ 0.6, respectively) this amounts to only ^0.1 dex evolution 
per major (/ > 1/3) merger, compared to e.g. ~ 0.3 dex scat- 
ter in the correlation. This is why, in FigurefTT] the absence of 
dry mergers makes a difference only at the highest end of the 
mass function, where the number of major dry mergers might 
be large. Most of the population did not experience substan- 
tial dry merging, and what did, experienced it mainly at late 
times. As a result, the inferred redshift evolution without dry 
mergers is largely the same, with only an offset in the rate of 
size evolution at z < 1-2 for the most massive systems (lead- 
ing to an offset in the relative size prediction at z ^ 3). 

It should be emphasized that the difference seen at the high- 
est masses and redshifts in Figure |23] pertains by definition 
to those ellipticals which are already that massive at those 
redshifts - i.e. systems which have formed and assembled 
^ 10'^ Mq by z > 2-3. These systems obviously live in the 
most dense environments, and are very rare - their evolution 
is strong because they are expected to undergo a large number 
of dry mergers, as well as mergers with later-forming gas- 
poor disks and spheroids and minor mergers (see §|5] below) 
that will build up an extended envelope and rapidly increase 
their sizes. The typical system with the same mass observed 
at lower redshifts was not fully assembled at these early times, 
but forms in less overdense regions and (although star forma- 
tion in progenitors may have ceased at early times) assembled 
more recently, in a smaller number of dry mergers. So the 
evolution seen should not be taken to imply a large number of 
dry mergers for the average massive galaxy, but rather to im- 
ply a large number of subsequent mergers ("large" being ~ a 
few; again see §|5]for discussion of how this can efficiently in- 
crease the spheroid size) for, in particular, galaxies in extreme 
environments that are already massive at early times. 
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5. EVOLUTION OF INDIVIDUAL SYSTEMS: WHAT HAPPENS TO 
THE SYSTEMS FORMED AT HIGH REDSHIFT? 

We have predicted that high-z massive ellipticals should be 
substantially more compact than low-z ellipticals of the same 
mass, in apparent agreement with observations. If this is true, 
then what happens to these early-forming, compact ellipticals 
as they evolve to z = 0? 

We are now interested in the evolution of individual sys- 
tems, from their formation redshift to z = 0, rather than the 
evolution of the population at the same mass (recall, the sys- 
tems with M* r^ 10" Mq at z ^ 2-3 are not, generally, the 
same galaxies that have M* ~ 10" M0 at z = 0). Figure l24l 
shows such a case study, in fact a set of them, tracking the 
median (averaged over many Monte Carlo realizations) evo- 
lution in sizes of individual systems that are formed at a given 
redshift and will have a specific stellar mass at z = 0. For two 
z = stellar masses where evolution is significant, we show the 
evolution in effective radii, velocity dispersions, and stellar 
masses. Figure |25] shows the same, but projects a few of the 
representative median tracks (for early-forming systems) into 
the size-stellar mass plane, to show how the systems move as 
a function of time and merger history in this space relative to 
e.g. the observed z = correlation. 

The systems which have their first major merger at some 
early time are indeed much more compact than their z = 
descendents. Partly, this is because they begin life as lower- 
mass systems, but even for their (instantaneous) stellar mass 
they are more compact than z = objects of that mass, re- 
flecting the higher degree of dissipation (the evolution in Fig- 
ure [T5]l. However, by z = 0, systems of the same stellar mass 
from different formation epochs all have effective radii within 
a narrow range (a factor of ~ 2). The systems which form 
late (z < 1) evolve weakly - they rarely experience any dry 
mergers, so their z = sizes largely reflect their sizes at for- 
mation. However, the systems which form early (z > 3) evolve 
strongly - they experience a number of both mergers with 
lower-redshift, larger and much less gas-rich disks, as well as 
other spheroids (true spheroid-spheroid dry mergers). Such 
mergers increase the sizes of the ellipticals both in absolute 
terms and in relative terms (moving them above the size-mass 
relation representative of the time when they formed). Al- 
though the number of major mergers for most systems is rel- 
atively small, it is a function of their formation time. The 
systems which first merge at earliest times do so because they 
live in highly biased environments - they are precisely those 
expected to undergo the most dry merging. As a result, the 
effects of evolving dissipational content and the effects of dry 
mergers appear to conspire such that, despite a significant evo- 
lution in the size-mass relation with redshift, by z = ellipti- 
cals formed at early times will have broadly similar sizes to 
those forming around z '^ 0. 

If we examine Figures l24l|25l we can plot out the 
likely course of evolution for the compact, massive 
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passive galaxies seen at z ^ 3 in 
Zirm et all ('200?): 'Labbe et al.' (^2005^; 'Wuvts et al." ("2007^ 
Daddietal., (2005); van Dokkum et al. (2008); Franxetal] 
(I2OO8I) ; iDa'mianov et all (2008). At the redshifts observed, 
these are systems which have formed recently (systems which 
form earlier will be even more rare) in a gas-rich merger 
(in agreement with their young observed ages, relative to 
z = spheroids of the same mass; Kriek et al. 2006), very 
high gas fractions > 30-40% characteristic of disks at these 
and higher redshifts. The large dissipational fraction yields 



a massive galaxy (M* > 10"Mq) with a very small effec- 
tive radius ^ 1 kpc. As discussed in § |4] above, however, 
the velocity dispersion is not extremely large (if we took 
a{z) = cr(M*,0) X Re(Q)/Re(z), we would estimate a ^ 600- 
800 km s"'; however, the predicted velocity dispersion is in 
fact ^ 250-300 kms"'). Because the system is so much more 
compact, the dark matter fraction within R,, is much smaller 
than in a typical z = spheroid of the same mass, so its con- 
tribution to the velocity dispersion is much smaller. 

The system, having formed in a high density environment 
(consisten t with then' observed clustering and number den- 
sities; e.g. lOuadri eTaI]|2007t Ivan Dokkum et al.ll2006l) . will 
undergo a significant number of mergers between this time 
and z ~ 1-2, where the merger rate begins to level off. 
These mergers are with a mix of disks and ellipticals. At 
times shortly after formation, successive mergers may be with 
other gas-rich disks, yielding similar amounts of dissipation 
and keeping the system relatively compact. This is illus- 
trated in Figure|25]as a "similar gas-rich merger"; which will 
move the system parallel to the size-mass relation (R^ oc M"*'). 
Soon, nearby systems will (given the biased environment) be 
largely elliptical - but they may be similar, compact ellipti- 
cals. Spheroid-spheroid dry mergers will puff up the system, 
moving it up somewhat relative to the initial size-mass rela- 
tion at formation. This is shown as a "dry merger with similar 
(compact) elliptical" in Figure |25]- here merging two identi- 
cal (equally compact) ellipticals doubles both R^ and M* (see 
§ 12.4. It , moving the system "up" relative to the size-mass re- 
lation along the steeper axis R^ ex M'*^. 

As time goes by (in a hierarchical scenario), the galaxies 
merging will have formed in a more extended physical region 
over longer periods of time (being recently accreted into the 
parent halo); merging disks at later times will be progressively 
more gas-poor (characteristic of low-z disks, which have con- 
sumed their gas in star formation), and thus contribute dissi- 
pationless stars with very large effective radii, building up the 
stellar component (an extended, violently relaxed envelope) at 
large radii. Likewise, pure spheroid-spheroid mergers will in- 
clude spheroids that formed later from such less gas-rich disks 
(and are therefore less compact), and these will serve to puff 
up the system even more efficiently and build up an envelope 
consistent with that typically observed in BCGs at z = (the 
expected location of such systems at z = 0, based on their clus- 
tering properties and inferred halo masses). This is denoted 
in Figure |25] by the label "dry /mixed mergers with gas-poor 
disks/later-forming (less compact) ellipticals," and allows the 
system to move more rapidly "up" in size relative to the size- 
mass relation (along an axis Rg oc M'"*"'*^, depending on the 
details of the type of mergers involved). This enables sys- 
tems that form compact for their mass to migrate to the z = 
size-mass relation in a relatively small number of mergers (in 
the illustrative example in Figure |25] the system moves from 
being 1 /4 the size of a z = analogue to lying on the local re- 
lation in just two equal-mass dry mergers, one with a similarly 
compact elliptical, one with a later-forming, less compact el- 
liptical; the latter merger builds a more extended envelope and 
therefore moves the system more rapidly towards agreement 
with the z = relation). 

At sufficiently large masses (especially for galaxies at the 
center of massive clusters), minor merge rs will in creasingly 
dominate th e growth of the galaxy (see e.g. lMaller et al. 200q; 
IZheng et alJIJOOTt lMas)edi et alll2007l) ; these will have sim- 
ilar effects to those described here, and may even further in- 
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Fig. 24. — Median size (left), velocity dispersion (middle), and instantaneous stellar mass (right) as a function of redshift, for various galaxies that have similar 
z = stellar mass (labeled). These median tracks are shown for systems that have their first major merger (that first form a spheroid) within narrow intervals in 
z (shown where each line begins: e.g. the line beginning at z ~ 3.5 shows the median track of all systems which have their first major merger at z, = 3.5 it 0.25 
and will have the same labeled stellar mass at j = 0). Different colors show different variations in our model, and reflect the uncertainties in redshift evolution (as 
the shaded range in Figure [Tst . Systems which form early are relatively compact, but grow in mass owing to subsequent mergers with gas-poor disks and other 
spheroids. 



crease the size at z = (especially if small merging systems 
are disrupted as they merge; in this case they will not con- 
tribute much to the central structure of the galaxy, but will in- 
creasingly build up an extended envelope at large radii). More 
detailed modeling of the structure of e.g. BCGs and the most 
massive (M* > lO'^M©) galaxies should account for both ma- 
jor and minor mergers in dense environments. 

The effective radius therefore rapidly grows, and by z = 
the system lies on (or even somewhat above) the median size- 
mass relation for systems of the same z = stellar mass. The 
velocity dispersion grows slightly as mass is accumulated, 
but since most of this mass is contributed from dissipation- 
less components to the extended envelope (recall, a merger of 
two identical spheroids will leave a unchanged), it does not 
contribute much, and the z = galaxy has a peak (central) ve- 
locity dispersion w 400 km s"', large but only 30% larger than 
its initial central velocity dispersion and completely consis- 
tent wit h those observed in the most massive galaxies today 
dBemardi et a l. 2006). In detail, in fact, the predicted descen- 
dants of early compact systems here have similar abundances, 
velocity dispersions, and remarkably similar predicted loca- 
tions in e.g. the z = size-mass, fund amental plane, and Faber- 
Jackson relations to the sample in Bern ard! et alj ([2006), sug- 
gesting that many of those systems may be the products of 
this process. For the most part, then, these extreme systems 
are completely consistent in all the properties we can predict 
here (effective radius, mass, velocity dispersion, black hole 
mass, and profile shape/Sersic index) with their constituting 
the central ^ 10-30% of the mass in a significant fraction of 
the brightest group or cluster galaxies today. 

This can explain why such objects are not ubiquitous in the 
local Universe. However, we could still ask what fraction of 
such systems might be expected to survive to z = 0, without 
disappearing into a larger z = galaxy. In short, should any 
of this massive and compact population exist at z = 0, which 



might represent the direct remnants of high-redshift, compact 
ellipticals? 

Figure|26]shows the fraction (at a given z = stellar mass) of 
objects which had their last major merger of any kind (which 
will largely set their observed size) at (or above) a given red- 
shift. At masses corresponding to those observed in high- 
redshift compact galaxy populations (~ 10" Mq), the fraction 
of the population today which has survived relatively intact 
since z > 2 is expected to be small, ^ 1 - 10%. This translates 
to a z = space density of such systems ~ lO'^^Mpc"^: not 
large, but not completely negligible. This is an upper limit 
to the number density of such systems: if they have experi- 
enced a sufficiently large number of minor mergers, they may 
be significantly altered by z = (although many of the same 
reasons that allow some systems to avoid major mergers will 
allow them to avoid a large number of minor mergers). At any 
redshift, however, there is a scatter in elliptical sizes owing to 
a scatter in e.g. the dissipational fractions at the time of their 
formation (reflecting scatter in disk gas fractions). Since disk 
gas fractions have significant (factor ^ 1.5-2) scatter, that is 
actually expected to dominate the scatter in e.g. the size-mass 
relation at z = (rather than scatter in the formation times 
of ellipticals). That is to say, there could be a similar frac- 
tion of ellipticals that are too compact present at z = which 
formed recently (not at z > 2), from the most anomalously 
gas-rich disks at lower redshifts (say the ~ 2 ct outliers in the 
disk gas fraction-mass relations). This probably will not ex- 
plain systems as small as ^^ 1 kpc at these stellar masses, but 
it illustrates that some additional indicator is needed to distin- 
guish those possibilities. If stellar population ages could be 
measured, this is straightforward: the system which formed 
at low redshift will have significant recent star formation (in 
fact, quite a lot, since we are interested in compact systems 
with large dissipational fractions). So it should be possible, 
in principle, to identify the descendents of the early-forming 
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Fig. 25. — Tracks followed by early-forming systems from Figure l24l in 
the size-stellar mass plane. Top: Symbol shape denotes the z = stellar mass 
(labeled), and color denotes redshift (from black at z = to red at z = 4, as 
labeled). Points show the median tracks as in Figure |24]for systems which 
first formed at a given redshift (the beginning of each plotted track). Lines 
show the z = size mass relation (solid), with slope Re oc M^^, and the 
same divided by a factor of 2 (dashed) and 4 (dotted). Bottom: Vectors il- 
lustrate motion in the plane owing to different types of mergers, for a system 
that will have a mass M» ~ IO'^Mq at z = but first forms as a compact 
M« ~ 1O"M0 spheroid (~ 1/4 the size of a typical similar-mass system at 
z = 0) at z ~ 3 — 4 in a gas-rich merger. As most systems are still gas-rich 
at these times, the next merger (z ~ 3 — 4) may be similarly gas-rich, mov- 
ing the system along the Re oc A/J'* slope (maintaining the relative compact- 
ness; red). In biased regions, nearby systems are also rapidly ti'ansformed 
into ellipticals (which, having their initial mergers at early times from gas- 
rich disks, are also compact). A 1:1 merger with an identical elliptical will 
double Mf and Re (Re oc M' "; green). At later times, systems merging will 
be newly accreted (into the halo), evolved (relatively gas-poor) disks, or el- 
lipticals made at later times from such evolved disks (which, being formed 
in more gas-poor mergers, will be less compact). Such a dry/mixed merger 
(with a less compact system) moves the system up more rapidly than dry 
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ellipticals. 

One important caveat to this discussion is that we have fo- 
cused here on the role of major mergers with mass ratios be- 
low 1:3. Our calcu lation and other halo occupa tion models 
(IZheng et al.l 1200 71). clustering meas urements (,Masiedi et aP 
l2007h . and cosmological simulations (iMaller et al.l2006h sug- 
gest that as galaxies approach the extreme M, ^ lO'^M© 
BCG populations, growth by a large series of minor mergers 
becomes progressively more important than growth by ma- 
jor mergers. Our predictions in this limit should therefore be 
treated with some degree of caution, in contrast with the bulk 
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Fig. 26. — Fraction of z = spheroids of a given stellar mass which had 
their last major merger (gas rich or gas poor) above a given redshift. A few 
percent of massive systems today are expected to have survived from a much 
earlier formation time z ~ 2 — 3 without any subsequent mergers modifying 
their structure - i.e. be remnants of the era where scaling relations were dif- 
ferent. These ellipticals would generally be compact for their mass, with old 
ages, and strongly clustered. 



of the elliptical population at masses < a few L*, the vast ma 
jority of which still live in less extreme environments (e. 
iBlanton et al! l2005t Imiig et all 120061: iMasjedi et all l200 
and are expected to have experienced a small number (^ a 
couple) of major re-mergers since redshifts z ~ 2 - 4. Our 
modeling can, in principle, be extended to include minor 
mergers, and doing so we find qualitatively similar conclu- 
sions (because merger histories are roughly self-similar, the 
relative growth and courses of evolution seen in Figure |24] 
are conserved). However, as demonstrated in Hopkins et al] 
(|2008c), our predictions (and those of other halo occupation- 
based approaches) become considerably less robust in the 
regime of minor mergers (mass ratios ^^1:10 or so), owing 
to greater uncertainty in the satellite mass function at these 
mass ratios and weaker observational constraints on the halo 
occupation of small galaxies. 

In any case, because early-forming, compact systems were 
massive and experienced mergers at such early times, many 
will represent the primary galaxies in the highest density 
peaks, being BCGs today. Indeed, this is true for most of 
the massive high-z spheroid population. However, in order 
to survive to z = from this population, mergers must be 
avoided (by our definition) - something very hard for BCGs. 
We would still expect to find such objects around high-density 
peaks (there is no getting around the fact that they will only 
form so early and so massive in high-density peaks), but those 
that survive, in many cases, would be expected to do so pre- 
cisely because they were not the central galaxy or BCG. (In 
detail, they probably were the central galaxy or BCG at their 
time of formation, but were displaced from this position in 
subsequent halo-halo mergers, becoming a satellite in a still 
larger system, suppressing their merger history). 

There may be some interesting candidates for such objects 
among the so-called "compact elliptical" family. For ex- 
ample (although we intend this only as an illustrative case), 
NGC 4486B is a satellite of NGC 4486 in Virgo. It has 
a stellar mass > IQ^^Mq (Paper II) and velocity dispersion 
a = 200kms~', but an effective radius of only 200 pc (see e.g. 
iKormendv et al.l 12008!) . This is somewhat smaller than the 
~ 1O"M0 which we just described, but the scenario can be 
generalized to masses about this low. The stellar population 
age is quite old, 10.5 Gyr (pe rhaps even up to ^ 1 3 Gyr, de- 
pending on the indicator used: ICaldwell et al.l2003h . implying 
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a time of last significant star formation z > 2. This is further 
supported by a corresponding a-enrichment and metallicities. 
The system has high rotation and ellipticity suggesti ve of a 
very gas-rich formation (Bender 1988; Bender etal. 19921) . 
The estimated dynamical friction time is sufficiently long that 
the system could have become a satellite in the Virgo pro- 
genitor as early as z ~ 2 and still not merged with the central 
galaxy. All of these lines of evidence suggest it may have 
survived from an early formation time as a very compact sys- 
tem. Unfortunately, except for the direct stellar population 
age measurements, these arguments are mostly circumstan- 
tial, and those stellar population estimates are still quite un- 
certain. Better determinations of the star formation history in 
this object would be an important test of this hypothesis. 

6. CONCLUSIONS 

We have combined the results from a large library of nu- 
merical simulations with simple, observationally constrained 
models for the halo occupation (and corresponding merger 
history) of galaxies in order to predict how spheroid scaling 
laws evolve. We demonstrate that, despite a variety of pro- 
genitor properties, complex merger histories, and evolution in 
e.g. progenitor sizes and gas content with redshift, the most 
important property driving the structural scaling relations of 
spheroids at each redshift (and, in particular, making these 
scaling relations different from those obeyed by disk galax- 
ies) is the amount of dissipation involved in the formation of 
the spheroid. At the highest masses, dry mergers at relatively 
late times are also important, but are few, consistent with ob- 
servational constraints. Together, these predict a significant 
mass-dependent evolution in spheroid scaling laws, which has 
important consequences for the fundamental plane, spheroid 
densities, and the formation histories of massive galaxies. 

At z = 0, a simple model accounting for the effects of 
dissipation in mergers is able to explain the size-mass, ve- 
locity dispersion-mass, and fundamental plane (dynamical- 
stellar mass) correlations, the relation between galaxy pro- 
file shape (Sersic index) and mass, the correlation between 
black hole mass and various host properties (bulge binding en- 
ergy, mass, velocity dispersion, and profile shape), the relative 
mass fractions in compact dissipational components in ellip- 
ticals, and the abundance of cuspy/disky/rapid ly rotating ver- 
sus co red/boxy/slowly rotating ellipticals (see iHopkins et al] 
l2008d) . Dissipation in the initial disk-disk mergers that form 
the spheroid progenitors is the most important parameter con- 
trolling these relations. 

In particular, at high masses, where disks are gas-poor, the 
sizes, densities, and dynamical mass to stellar mass ratios of 
ellipticals reflect those of their progenitor disks. At lower 
masses, where disks are gas-rich, mergers involve a signif- 
icant amount of new star formation in dissipational nuclear 
starbursts triggered by loss of energy and angular momentum 
in the gas in the merger, building up a compact central stel- 
lar distribution (the "dissipational" component) that yields a 
smaller, more dense remnant. This mass-dependence in the 
dissipational content of progenitor disks makes the size-mass 
relation of ellipticals steeper than that of disks and gives the 
observed tilt in the fundamental plane (essentially the state- 
ment that low-mass disks, being more dominated by this nu- 
clear dissipational component, have higher baryon-to-dark 
matter fractions within their stellar effective radii). Black 
holes are formed with a mass that, in a simple feedback- 
regulated model, corresponds to the binding energy of the 
dissipational component of bulge at the time of this dissipa- 



tion, which the black hole must work against in order to self- 
regulate its growth. This matches th e detailed observations 
of a black hole fundamental plane in iHopkins et al.l (l2007d) 
and gives rise to the secondary (indirect) correlations between 
black hole mass and spheroid velocity dispersion, mass, size, 
and Sersic index (all of which agree with those observed). 

To lowest order, the dissipational fractions that give rise 
to these trends at z = reflect the observed gas fractions of 
disk galaxies at moderate redshifts z ~ 0-2. This is because 
most ellipticals are formed in a relatively small number of 
major mergers at low redshifts in this range. It is only when 
one considers the most massive populations of e.g. BCGs and 
massive cored, boxy, slowly rotating ellipticals that dry merg- 
ers become important. Their effect is to puff up these sys- 
tems, producing extended envelopes and raising their outer 
Sersic indices. This is sufficient to explain the observed weak 
dependence of outer Sersic index on mass in elliptical galax- 
ies, following the detailed multi-component decompositions 
of observed systems in Paper II and Paper III. When systems 
are fitted to a single Sersic index, the result is a steeper depen- 
dence that reflects a complex combination of e.g. the outer 
Sersic index (primarily driven by merger history), the mass 
fraction in a dissipational component (yielding rising densi- 
ties at small radius), the shape of the two components where 
they are comparable, and the dynamic range of the fit (since 
real ellipticals are not perfect Sersic profiles, there is some de- 
pendence, and in particular when fitting the entire profile to a 
single Sersic law, the dependence can be significant). 

The detailed interplay of dissipation and dry mergers also 
gives rise to important residual trends that can be used as tests 
of these models. For example, at a given stellar mass, the old- 
est systems will have the largest effective radii and dynamical 
masses. This is driven by two effects: first, disks which have 
high star formation efficiencies (earlier star formation times) 
will be more gas-poor at the time of their mergers - i.e. will 
have less mass in a dissipational component. Second, systems 
which may be gas-rich but form very early (i.e. have a high 
redshift of their last gas-rich merger) tend to do so because 
they are in dense environments, and will therefore experience 
subsequent mergers with both more gas-poor disks and other 
spheroids, increasing their size and dynamical-to-stellar mass 
ratios. 

At higher redshifts, disk gas fractions are expected and ob- 
served to be systematically larger. As a result, spheroids ob- 
served at these redshifts are expected to have formed in more 
gas-rich mergers and to have a larger mass fraction in their 
dissipational component, making them more compact. We 
predict that elliptical sizes evolve in a mass-dependent fash- 
ion: since low-mass disks are still quite gas-rich at z = 0, there 
is little room for them to become more so at high redshift, 
so they cannot evolve much. High mass disks, on the other 
hand, could in principle be much more gas rich than their ob- 
served z = < 10% values. As well, at high masses, dry merg- 
ers become important in the predicted size evolution. Conse- 
quently, the highest mass ellipticals are expected to evolve 
to become a factor '^ 2 - 3 smaller at z > 2. In terms of the 
mean effective radius of spheroids of a given stellar mass, rel- 
ative to that at z = 0, we find evolution of the form ( 1 + zT^ 
where f3 ^ 0.23 log(M^,/lO^M0) (of course enforcing /3 > 
or whatever minimum evolution is set by observed evolution 
in progenitor disk sizes at these masses). 

This is consis tent with o bservations f rom e.g.lTruiillo et al.l 
(2006i): Mcintosh et all (2005); I Zirm et all (l2007h; 
Ivan Dokkum et al] (l2008l) . although we highlight impor- 
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tant biases arising from stellar population gradients which 
are relevant when attempting to estimate the sizes of the most 
massive galaxies at z > 2. We demonstrate that this size 
evolution owes almost entirely to the change in dissipation 
with redshift, although dry mergers are of some importance 
for the most massive systems. Whether we explicitly include 
it or not, the evolution cannot be explained by simple scaling 
of disk sizes with redsh ift (which, in any case, is observed 
to be weak; see e.g. Somerville et a l. 2008b, and references 
therein). 

This evolution has important implications for the other fun- 
damental plane correlations. Because the fundamental plane 
tilt (at z = 0) arises owing to systematically higher dissipa- 
tional fractions in low-mass systems (making them more com- 
pact and yielding lower dark matter fractions within the stellar 
Re), at high redshift (where even high-mass systems must be 
highly dissipational, since disk gas fractions are much higher) 
we expect the tilt to be weaker. Note that we refer here to the 
stellar mass fundamental plane, i.e. Mdyn versus M» - there 
will of course be stellar population effects introducing nor- 
malization and tilt changes in various observed bands. The 
predicted change is within the z = uncertainties at z < 1, but 
by z > 3, the tilt is predicted to largely disappear. This is im- 
portant both as a test of these models, but also as a caution 
to observational attempts to use the fundamental plane as a 
detailed stellar population probe. It is certainly much less sig- 
nificant than the expected evolution in mass-to-light ratios in 
optical bands, but could be im portant for de tailed stellar pop- 
ulation probes such as that in van DokkumI (|2008) which as- 
sume a fixed stellar mass fundamental plane (although selec- 
tion of e.g. satellite galaxies in clusters, which are less likely 
to undergo significant future merging relative to central galax- 
ies in the field or groups, may be sufficient to mitigate these 
evolutionary effects). 

We argue that velocity dispersions will change weakly with 
redshift, despite the evolution in effective radii, because sys- 
tems with higher dissipational fractions (smaller R,,) are more 
baryon-dominatedin their stellar effective radii (i.e. the veloc- 
ity dispersion includes a weaker contribution from the halo). 
This is important, because velocity dispersion is in general 
preserved or increases in subsequent re-mergers. If high- 
redshift massive galaxies had extreme velocity dispersions 
> 600km s"' (corresponding to the simple assumption that a 
is inversely proportional to just the stellar effective radius), 
then the appropriate number of systems with equal or higher 
velocity dispersions would have to exist at z = (and they 
do no t, even in survey volumes of the SDSS; iBernardi et al.l 
120061) . However, accounting for the interplay between dark 
matter and baryonic mass, our predicted evolution in velocity 
dispersions is completely consistent with observational con- 
straints. 

Furthermore, because of the increasing dissipational com- 
ponents and deepening potential wells at high redshift, we 
expect that, in any feedback-regulated model of black hole 
growth, black holes must become at least somewhat more 
massive (relative to their host spheroid stellar mass) at 
high redshift. We predict a similar mass-dependent evolu- 
tion in Mbh/A^*, with little or no significant evolution at 
Mf, < lO'^Mo (corresponding to black holes with masses < 
IO^Mq), and factor ^ 2 increase to z ^ 2 - 3 in the masses of 
the black holes in the most massive systems (M, ^ 10" Mq, 
corresponding to Mbh ^ IO'^Mq). Likewise, we predict the 
evolution in black hole mass relative to host velocity disper- 



sion, which exhibits a more complex behavior. The physi- 
cal de tails of driving these t rends are discussed in more de- 
tail in lHopkins et al.l (12007 b'). but here we embed those mod- 
els in a more fully cosmological context. The evolution 
predicted is consistent with recent observations (Pen g et aP 
2006) and with indirect constraints based on the relative evo- 
lution of black hole mass density and sph eroid mass density 
dHopkins et al.ll2006cl; iMerloni et al.ll2004. however i t is still 
relatively weak. Selection effects (iLauer et al.ll2007a) would 
be expected to explain the difference bet ween some observa- 
tional estimates that i nfer very strong ( e.g. lWalter et al.ll2004l) 
or rapid low -redshift (IWoo et al.ll2006l) evolution and our pre- 
dictions. 

We outline the history of these massive systems which are 
different when first formed at high redshift (where they are 
very dissipational) and observed at low redshift. Most of these 
compact, early forming ellipticals will undergo a significant 
number of dry mergers, or mergers with lower-redshift, more 
gas-depleted and larger disks, which will serve to puff up the 
remnant, building up some stellar mass and an extended stel- 
lar envelope (although changing the central velocity disper- 
sion by a relatively small amount). This raises their effective 
radii more rapidly than their stellar mass, in the sense of mov- 
ing them up in the size-mass relation. Interestingly, the evo- 
lution in disk gas fractions and the effects of such dry merg- 
ers almost exactly offset one another in massive systems. In 
other words, a ^ 1O'^M0 galaxy which first formed from a 
merger at z = 3 (which had a high gas content, leading to a 
very compact > lO^M© initial galaxy with Rg ^ I kpc) will 
have almost the same size at z = as a galaxy of the same 
observed stellar mass which formed at much later time, say 
z = 1, from a less dissipational merger (because disk gas frac- 
tions have decreased in that time interval). The system which 
formed earlier was initially more compact, but experienced 
more mergers in the intervening time, so for a given stellar 
mass, the scatter in Rg is not large. Likewise, velocities dis- 
persions and other fundamental plane properties have small 
scatter at a given stellar mass at any observed redshift because 
of this cancellation effect. 

This is not to say that most systems will undergo many 
dry mergers - the overall density of dry mergers predicted 
here is low, consi s tent with observation al constraints from e.g. 
iBell et al.1 (l2006l) : Ivan Dokkum (.20011) : [Lin et al.1 dlOOSh . and 
as we noted above most (especially < L*) ellipticals today 
formed in a few (or just one) gas-rich mergers at relatively 
low redshifts. However, the systems which form earliest do 
so because they live in the most dense environments, which 
evolve the most rapidly, and are expected to undergo the most 
subsequent merging activity. We predict, as a result, that most 
of these compact systems observed at high redshifts will end 
up constituting some of the central dense stellar mass in ob- 
served brightest cluster or group ga laxies at z = 0; this is con - 
sistent with the observed sizes (von der Linden et al.ll2007l) . 
surface brightness profile shapes (extended envelopes build up 
throug h their subsequent d ry merging) (see e.g. Paper III and 
Gallag her & Ostrikd[T97 2; Naab et al. 2006b), and masses 
and velocity dispersions ( Bernardi et al.i ,2007) of these z = 
systems, and also expected gi ven the cluster ing properties of 
high redshift massive systems (lOuadri et al.l l2007). 

We estimate what fraction, as a function of stellar mass, of 
the massive galaxy population today may have survived from 
these early times un-remerged and thus reflect the smaller 
sizes and higher dissipational content characteristic of sys- 
tems formed at that time, finding that e.g. ~ 1 -5% of z = 
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^ 1O"M0 systems may have survived since z^2. This is 
small, but not negligible, and by identifying compact ellipti- 
cals with old stellar population ages, it should be possible to 
locate the surviving remnants of this population. We predict 
that these will be massive, compact (with other indicators of 
gas-rich origin and high inferred dissipational fractions from 
their stellar profiles), with old stellar population ages. Be- 
cause they still generally would initially form in high-density 
environments, but have avoided merging since then, we would 
roughly expect them to be in dense environments (rich groups 
and clusters) today, but not to be the central galaxy (again, 
most early-forming systems will be the central galaxy - but if 
they are all the way until z = 0, it is unlikely that they will have 
avoided further mergers, and thus would not be un-remerged). 
There may be some observed candidate members of this popu- 
lation in the so-called "compact elliptical" class, galaxies sim- 
ilar to NGC 4486B in Virgo which appears to satisfy these cri- 
teria. Further observations to test this hypothesis could prove 
extremely valuable as nearby probes of high-redshift galaxy 
evolution. 

Future work should extend these models to include the (at 
present more uncertain) role of particularly minor mergers 
(where e.g. satellite disruption may be important to explain 
the extended halos in massive BCGs) and oth er sources of dis- 
sipation, such as stellar mass loss (see e.g. Ciotti & Ostrikeij 
|2007). However, we show that other effects, e.g. evolution in 
the sizes of disk galaxies, dry mergers alone, and evolution in 
galaxy host halo properties, are insufficient to explain effects 
like those predicted here, in particular to account for e.g. the 
tilt of the fundamental plane, the relative sizes of ellipticals 
(versus those of disks) as a function of mass, the scaling of el- 
liptical galaxy profile shapes with mass, and the redshift evo- 



lution in the fundamental plane correlations. Therefore, these 
predictions are important tests of any model in which dissipa- 
tional star formation is a significant influence on galaxy for- 
mation. 

Because dissipation is necessary in a basic sense to recon- 
cile the densities of ellipticals and spirals, this represents an 
important test of the merger hypothesis itself, as well as a 
test of our understanding of galaxy merger histories and the 
evolution in different galaxy components with redshift. In ad- 
dition, we provide a unified theoretical lens through which 
to interpret a number of observations of spheroid and black 
hole-host scalings at both z = and high redshift, in which 
spheroids are fundamentally multi-component objects (with 
dissipational components originally formed in central star- 
bursts triggered in the initial gas-rich interactions that formed 
the system, and dissipationless components representing the 
scattered, violently relaxed stars from the pre-initial merger 
stellar disks). As multi-component objects, it is primarily 
the relative mass fraction in the dissipational component, re- 
flecting the gas content of spheroid progenitors, that drives 
their structural properties and the evolution of those structural 
properties with redshift. 



We thank Tod Lauer, John Kormendy, Sadegh Khochfar, 
Marijn Franx, Ivo Labbe, Norm Murray, Chien Peng, and 
Barry Rothberg for helpful discussions and contributed data 
sets used in this paper. This work was supported in part by 
NSF grants ACI 96-19019, AST 00-71019, AST 02-06299, 
and AST 03-07690, and NASA ATP grants NAG5-12140, 
NAGS- 13292, and NAGS- 13381. Support for TJC and SW 
was provided by the W. M. Keck Foundation. 



REFERENCES 



Adelberger, K. L., & Steidel, C. C. 2005, ApJ, 627, LI 

Akiyama, M., Minowa, Y, Kobayashi, N., Ohta, K., Ando, M., & Iwata, I. 

2008, ApJS, 175, 1 
Aller, M. C, & Richstone, D. O. 2007, ApJ, 665, 120 
Almeida, C, Baugh, C. M., & Lacey, C. G. 2007, MNRAS, 376, 171 1 
Barden, M., et al. 2005, ApJ, 635, 959 
Barnes, J. E. 1992, ApJ, 393, 484 
Baugh, C. M., Lacey, C. G., Frenk, C. S., Granato, G. L., Silva, L., Bressan, 

A., Benson, A. J., & Cole, S. 2005, MNRAS, 356, 1191 
Bell, E. P., & de Jong, R. S. 2000, MNRAS, 312, 497 
Bell, E. P., Mcintosh, D. H., Katz, N., & Weinberg, M. D. 2003a, ApJ, 585, 

L117 
— . 2003b, ApJS, 149, 289 
Bell, E. P, et al. 2006, ApJ, 640, 241 
Bender, R. 1988, A&A, 193, L7 

Bender, R., Burstein, D., & Faber, S. M. 1992, ApJ, 399, 462 
Benson, A. J., Lacey, C. G., Frenk, C. S., Baugh, C. M., & Cole, S. 2004, 

MNRAS, 351, 1215 
Bernardi, M., Hyde, J. B., Sheth, R. K., Miller, C. J., & Nichol, R. C. 2007, 

AJ, 133, 1741 
Bernardi, M., et al. 2003, AJ, 125, 1866 
— . 2006, AJ, 131,2018 
Blanton, M. R. 2006, ApJ, 648, 268 
Blanton, M. R., Eisenstein, D., Hogg, D. W., Schlegel, D. J., & Brinkmann, 

J. 2005, ApJ, 629, 143 
Bolton, A. S., Buries, S., Treu, T., Koopmans, L. V. E., & Moustakas, L. A. 

2007, ApJ, 665, L105 
Bolton, A. S., Treu, T., Koopmans, L. V. E., Gavazzi, R., Moustakas, 

L. A., Buries, S., Schlegel, D. J., & Wayth, R. 2008, ApJ, in press 

arXiv:0805.1932 [astro-ph], 805 
Borch, A., et al. 2006, A&A, 453, 869 
Boumaud, P., Daddi, E., Elmegreen, B. G., Elmegreen, D. M., Nesvadba, N., 

Vanzella, E., di Matteo, R, Le Tiran, L., Lehnert, M., & Elbaz, D. 2008, 

A&A, 486, 741 
Boumaud, F, Jog, C. J., & Combes, P. 2005, A&A, 437, 69 



Bower, R. G., Benson, A. J., Malbon, R., Helly, J. C, Frenk, C. S., Baugh, 

C. M., Cole, S., & Lacey, C. G. 2006, MNRAS, 370, 645 
Boylan-Kolchin, M., Ma, C.-R, & Quataert, E. 2005, MNRAS, 362, 184 
— . 2006, MNRAS, 369, 1081 
Brown, M. J. I., Dey, A., Jannuzi, B. T., Brand, K., Benson, A. J., Brodwin, 

M., Croton, D. J., & Eisenhardt, R R. 2007, ApJ, 654, 858 
Brown, M. J. 1., et al. 2008, ApJ, in press, arXiv:0804.2293 [astro-ph], 804 
Bullock, J. S., Kolatt, T. S., Sigad, Y., Somerville, R. S., Kravtsov, A. V., 

Klypin, A. A., Piimack, J. R., & Dekel, A. 2001, MNRAS, 321, 559 
Bundy, K., Ellis, R. S., & Conselice, C. J. 2005, ApJ, 625, 621 
Bundy, K., et al. 2008, ApJ, 681, 931 
Buote, D. A., Gastaldello, P., Humphrey, P. J., Zappacosta, L., Bullock, J. S., 

Brighenti, P, & Mathews, W. G. 2007, ApJ, 664, 123 
Burkert, A., Naab, T., & Johansson, R H. 2007, ApJ, in press 

arXiv:0710.0663 [astro-ph], 710 
Caldwell, N., Rose, J. A., & Concannon, K. D. 2003, AJ, 125, 2891 
Calura, F, Jimenez, R., Panter, B., Matteucci, F, & Heavens, A. P. 2007, ApJ, 

in press ai-Xiv:0707.1345 [astro-ph], 707 
Cappellari, M., et al. 2006, MNRAS, 366, 1126 
Cattaneo, A., Dekel, A., Devriendt, J., Guiderdoni, B., & Blaizot, J. 2006, 

MNRAS, 370, 1651 
Ceveiino, D., & Klypin, A. 2007, ApJ, in press arXiv:0712.3285 [astro-ph], 

712 
Ciotti, L., Lanzoni, B., & Volonteri, M. 2007, ApJ, 658, 65 
Ciotd, L., & Ostriker, J. R 2007, ApJ, 665, 1038 
Comerford, J. M., & Natarajan, R 2007, MNRAS, 379, 190 
Conroy, C, & Wechsler, R. H. 2008, ApJ, in press, arXiv:0805.3346 [astro- 
ph], 805 
Conroy, C, Wechsler, R. H., & Ki'avtsov, A. V. 2006, ApJ, 647, 201 
Conroy, C, et al. 2007, ApJ, 654, 153 
Conselice, C. J., Bundy, K., Ellis, R. S., Brichmann, J., Vogt, N. R, & Phillips, 

A. C. 2005, ApJ, 628, 160 
Cooray, A. 2005, MNRAS, 364, 303 
— . 2006, MNRAS, 365, 842 
Covington, M., Dekel, A., Cox, T. J., Jonsson, P, & Piimack, J. R. 2008, 

MNRAS, 384, 94 



26 



Hopkins et al. 



Cox, T. J., Dutta, S. N., Di Matteo, T., Hernquist, L., Hopkins, P. R, 

Robertson, B., & Springel, V. 2006, ApJ, 650, 791 
Cox, T. J., et al. 2008, ApJ, in preparation 
Croft, R. A. C, Di Matteo, T., Springel, V., & Hernquist, L. 2008, MNRAS, 

in press, arXiv:0803.4003 [astro-ph], 803 
Croton, D. J., et al. 2006, MNRAS, 365, 1 1 
Daddi, E., et al. 2005, ApJ, 626, 680 

Damjanov, 1., et al. 2008, ApJ, in press, arXiv:0807.1744 [astro-ph], 807 
Dasyra, K. M., et al. 2006, ApJ, 638, 745 
de Lucia, G., & Blaizot, J. 2007, MNRAS, 375, 2 
De Lucia, G., Kauffmann, G., Springel, V., White, S. D. M., Lanzoni, B., 

Stoehr, R, Tormen, G., & Yoshida, N. 2004, MNRAS, 348, 333 
De Lucia, G., Springel, V., White, S. D. M., Croton, D., & Kauffmann, G. 

2006, MNRAS, 366, 499 
de Vaucouleurs, G. 1948, Annales d'Astrophysique, 11, 247 
Di Matteo, T., Colberg, J., Springel, V., Hernquist, L., & Sijacki, D. 2008, 

ApJ, 676, 33 
Di Matteo, T., Springel, V., & Hernquist, L. 2005, Nature, 433, 604 
di Serego Alighieri, S., et al. 2005, A&A, 442, 125 
D'Onghia, E., Burkert, A., Murante, G., & Khochfar, S. 2006, MNRAS, 372, 

1525 
Erb, D. K. 2008, ApJ, 674, 151 
Erb, D. K., Steidel, C. C, Shapley, A. E., Pettini, M., Reddy, N. A., & 

Adelberger, K. L. 2006, ApJ, 646, 107 
Faber, S. M., Tremaine, S., Ajhar, E. A., Byun, Y.-L, Dressier, A., Gebhardt, 

K., Grillmair, C, Kormendy, J., Lauer, T. R., & Richstone, D. 1997, AJ, 

114, 1771 
Fakhouri, O., & Ma, C.-R 2008, MNRAS, 386, 577 
Ferguson, H. C, et al. 2004, ApJ, 600, L107 
Ferrarese, L., & Merritt, D. 2000, ApJ, 539, L9 
Ferrarese, L., et al. 2006, ApJS, 164, 334 
Flores, H., Hammer, R, Puech, M., Amram, R, & Balkowski, C. 2006, A&A, 

455, 107 
Fontana, A., et al. 2006, A&A, 459, 745 
Franx, M., van Dokkum, P. G., Forster Schreiber, N. M., Wuyts, S., Labbe, 

I., Toft, S., et al. 2008, ApJ, in preparation 
Gallagher, 111, J. S., & Ostriker, J. P 1972, AJ, 77, 288 
Gallazzi, A., Chariot, S., Brinchmann, J., & White, S. D. M. 2006, MNRAS, 

370, 1106 
Gao, L., White, S. D. M., Jenkins, A., Stoehr, R, & Springel, V. 2004, 

MNRAS, 355, 819 
Gebhardt, K., et al. 2000, ApJ, 539, L13 
Genzel, R., et al. 2008, ArXiv e-prints, 807 
Govemato, F., Willman, B., Mayer, L., Brooks, A., Stinson, G., Valenzuela, 

O., Wadsley, J., & Quinn, T. 2007, MNRAS, 374, 1479 
Graham, A. W., & Driver, S. P 2007, ApJ, 655, 77 
Graham, A. W., Erwin, P, Caon, N., & Trujillo, 1. 2001, ApJ, 563, LI 1 
Granato, G. L., De Zotti, G., Silva, L., Bressan, A., & Danese, L. 2004, ApJ, 

600, 580 
Graves, J., et al. 2008, ApJ, in preparation 
Hiiring, N., & Rix, H.-W. 2004, ApJ, 604, L89 
Hashimoto, Y, Funato, Y, & Makino, J. 2003, ApJ, 582, 196 
Hausman, M. A., & Ostriker, J. P 1978, ApJ, 224, 320 
Hernquist, L. 1989, Nature, 340, 687 
— . 1990, ApJ, 356, 359 

Hernquist, L., & Mihos, J. C. 1995, ApJ, 448, 41 
Hernquist, L., Spergel, D. N., & Heyl, J. S. 1993, ApJ, 416, 415 
Heymans, C, et al. 2006, MNRAS, 371, L60 

Hopkins, P F, Bundy, K., Hernquist, L., & Ellis, R. S. 2007a, ApJ, 659, 976 
Hopkins, P. F., Cox, T. J., Dutta, S. N., Hernquist, L., Kormendy, J., & Lauer, 

T. R. 2008a, ApJ, accepted, arXiv:0805.3533 [astro-ph], 805 
Hopkins, P R, Cox, T. J., & Hernquist, L. 2()08b, ApJ, accepted, 

arXiv:0806.3974 [astro-ph], 806 
Hopkins, P R, Cox, T. J., Keres, D., & Hernquist, L. 2008c, ApJS, 175, 390 
Hopkins, P R, Cox, T. J., Younger, J. D., & Hernquist, L. 2008d, ApJ, 

accepted, arXiv:0806.1739 [astro-ph], 806 
Hopkins, P R, & Hernquist, L. 2006, ApJS, 166, 1 
Hopkins, P. R, Hernquist, L., Cox, T. J., Di Matteo, T., Martini, P., Robertson, 

B., & Springel, V. 2005a, ApJ, 630, 705 
Hopkins, P R, Hernquist, L., Cox, T. J., Dutta, S. N., & Rothberg, B. 2008e, 

ApJ, 679, 156 
Hopkins, P R, Hernquist, L., Cox, T. J., & Keres, D. 2008f, ApJS, 175, 356 
Hopkins, P. F, Hernquist, L., Cox, T. J., Robertson, B., Di Matteo, T., & 

Springel, V. 2006a, ApJ, 639, 700 
Hopkins, P. F, Hernquist, L., Cox, T. J., Robertson, B., & Krause, E. 2007b, 

ApJ, 669, 45 
— . 2007c, ApJ, 669, 67 



Hopkins, P F, Hernquist, L., Cox, T. J., Robertson, B., & Springel, V. 2006b, 

ApJS, 163, 50 
Hopkins, P F, Hernquist, L., Cox, T. J., Younger, J. D., & Besla, G. 2008g, 

ApJ, accepted, arXiv:0806.2861 [astro-ph], 806 
Hopkins, P. R, Hernquist, L., Martini, P., Cox, T. J., Robertson, B., Di Matteo, 

T., & Springel, V. 2005b, ApJ, 625, L71 
Hopkins, P. R, Lauer, T. R., Cox, T. J., Hernquist, L., & Kormendy, J. 2008h, 

ApJ, in press, arXiv:0806.2325 [astro-ph], 806 
Hopkins, P. P., Lidz, A., Hernquist, L., Coil, A. L., Myers, A. D., Cox, T. J., 

& Spergel, D. N. 2007d, ApJ, 662, 1 10 
Hopkins, P F, Narayan, R., & Hernquist, L. 2006c, ApJ, 643, 641 
Johansson, P. H., Naab, T., & Burkert, A. 2008, ApJ, in press 

arXiv:0802.0210 [astro-ph], 802 
Kannappan, S. J. 2004, ApJ, 611, L89 
Kassin, S. A., et al. 2007, ApJ, 660, L35 
Kazantzidis, S., Bullock, J. S., Zentner, A. R., Kravtsov, A. V., & Moustakas, 

L. A. 2007, ApJ, in press, arXiv:0708.1949 [astro-ph], 708 
Kennicutt, Jr., R. C. 1998, ApJ, 498, 541 
Keres, D., et al. 2007, ApJ, in preparation 

Keres, D., Katz, N., Weinberg, D. H., & Dave, R. 2005, MNRAS, 363, 2 
Khochfar, S., & Silk, J. 2006, ApJ, 648, L21 
Kobayashi, C. 2004, MNRAS, 347, 740 

Komatsu, E., et al. 2008, ApJ, in press, arXiv:0803.0547 [astro-ph], 803 
Kormendy, J., Fisher, D. B., Cornell, M. E., & Bender, R. 2008, ApJ, in press 
Kormendy, J., & Richstone, D. 1995, ARA&A, 33, 581 
Kravtsov, A. V., Berlind, A. A., Wechsler, R. H., Klypin, A. A., Gottlober, 

S., Allgood, B., & Primack, J. R. 2004, ApJ, 609, 35 
Kriek, M., et al. 2006, ApJ, 649, L71 

Krivitsky, D. S., & Kontorovich, V. M. 1997, A&A, 327, 921 
Labbe, 1., et al. 2005, ApJ, 624, L81 
Lauer, T. R., Tremaine, S., Richstone, D., & Faber, S. M. 2007a, ApJ, 670, 

249 
Lauer, T R., et al. 2007b, ApJ, 664, 226 
Lidz, A., Hopkins, P F, Cox, T. J., Hernquist, L., & Robertson, B. 2006, ApJ, 

641,41 
Lin, L., et al. 2008, ApJ, in press, arXiv:0802.3004 [astro-ph], 802 
Magorrian, J., et al. 1998, AJ, 115, 2285 
Makino, J., & Hut, P 1997, ApJ, 481, 83 
Mailer, A. H., Katz, N., Keres, D., Dave, R., & Weinberg, D. H. 2006, ApJ, 

647, 763 
Mamon, G. A. 2006, in Groups of Galaxies in the Nearby Universe, ed. 

I. Saviane, V. Ivanov, & J. Borissova 
Marconi, A., & Hunt, L. K. 2003, ApJ, 589, L21 
Masjedi, M., Hogg, D. W., & Blanton, M. R. 2007, ApJ, in press, 

arXiv0708.3240 [astro-ph]:, 708 
Masjedi, M., et al. 2006, ApJ, 644, 54 
McGaugh, S. S. 2005, ApJ, 632, 859 
McGaugh, S. S., Schombert, J. M., Bothun, G. D., & de Blok, W. J. G. 2000, 

ApJ, 533, L99 
Mcintosh, D. H., et al. 2005, ApJ, 632, 191 

Merloni, A., Rudnick, G., & Di Matteo, T. 2004, MNRAS, 354, L37 
Merritt, D., & Ferrarese, L. 2001, ApJ, 547, 140 
Mo, H. J., Mao, S., & White, S. D. M. 1998, MNRAS, 295, 319 
Monaco, P, & Fontanot, R 2005, MNRAS, 359, 283 
Murray, N., Quataert, E., & Thompson, T. A. 2005, ApJ, 618, 569 
Naab, T., Jesseit, R., & Burkert, A. 2006a, MNRAS, 372, 839 
Naab, T., Johansson, P H., Ostriker, J. P, & Efstathiou, G. 2007, ApJ, 658, 

710 
Naab, T., Khochfar, S., & Burkert, A. 2006b, ApJ, 636, L81 
Naab, T., & Trujillo, 1. 2006, MNRAS, 369, 625 
Navarro, J. R, Rrenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563 
Neistein, E., van den Bosch, R C, & Dekel, A. 2006, MNRAS, 372, 933 
Nelan, J. E., et al. 2005, ApJ, 632, 137 
Neto, A. R, et al. 2007, MNRAS, 381, 1450 
Noeske, K. G., et al. 2007, ApJ, 660, L47 
Nurmi, P., Heinamaki, P., Saar, E., Einasto, M., Holopainen, J., Martinez, 

V. J., & Einasto, J. 2006, A&A, in press [astro-ph/061 1941] 
Peng, C. Y, Impey, C. D., Rix, H.-W., Kochanek, C. S., Keeton, C. R., Falco, 

E. E., Lehar, J., & McLeod, B. A. 2006, ApJ, 649, 616 
Perez-Gonzalez, P G., et al. 2008, ApJ, 675, 234 

Puech, M., Hammer, R, Lehnert, M. D., & Flores, H. 2007, A&A, 466, 83 
Quadri, R., et al. 2007, ApJ, 654, 138 
Ravindranath, S., et al. 2004, ApJ, 604, L9 
Reddy, N. A., Steidel, C. C, Fadda, D., Yan, L., Pettini, M., Shapley, A. E., 

Erb, D. K., & Adelberger, K. L. 2006, ApJ, 644, 792 
Rix, H.-W., et al. 2004, ApJS, 152, 163 
Robertson, B., Cox, T J., Hernquist, L., Franx, M., Hopkins, P. R, Martini, 

P, & Springel, V. 2006, ApJ, 641, 21 



Evolution in Spheroid Scalings 



27 



Robertson, B., Yoshida, N., Springel, V., & Hemquist, L. 2004, ApJ, 606, 32 

Rothberg, B., & Joseph, R. D. 2004, AJ, 128, 2098 

Salpeter, E. E. 1964, ApJ, 140, 796 

Salviander, S., Shields, G. A., Gebhardt, K., & Bonning, E. W. 2006, New 

Astronomy Review, 50, 803 
Scannapieco, E., & Oh, S. R 2004, ApJ, 608, 62 
Sersic, J. L. 1968, Atlas de galaxias australes (Cordoba, Argentina: 

Observatorio Astronomico, 1968) 
Shapiro, K. L., et al. 2008, ApJ, 682, 231 

Shapley, A. E., Coil, A. L., Ma, C.-R, & Bundy, K. 2005, ApJ, 635, 1006 
Shen, S., Mo, H. J., White, S. D. M., Blanton, M. R., Kauffmann, G., Voges, 

W., Brinkmann, J., & Csabai, 1. 2003, MNRAS, 343, 978 
Sheth, R. K., Mo, H. J., & Tormen, G. 2001, MNRAS, 323, 1 
Shields, G. A., Menezes, K. L., Massart, C. A., & Vanden Bout, R 2006, ApJ, 

641, 683 
Sijacki, D., Springel, V., di Matteo, T., & Hemquist, L. 2007, MNRAS, 380, 

877 
Silk, J., & Rees, M. J. 1998, A&A, 331, LI 
Somerville, R. S., Hopkins, P. R, Cox, T. J., Robertson, B. E., & Hemquist, 

L. 2008a, MNRAS, in press, ai-Xiv:0808.1227 [astro-ph] 
Somerville, R. S., Primack, J. R., & Faber, S. M. 2001, MNRAS, 320, 504 
Somerville, R. S., et al. 2008b, ApJ, 672, 776 
Spergel, D. N., et al. 2003, ApJS, 148, 175 
— . 2007, ApJS, 170, 377 

Springel, V., Di Matteo, T., & Hemquist, L. 2005, MNRAS, 361, 776 
Springel, V., White, S. D. M., Tormen, G., & Kauffmann, G. 2001, MNRAS, 

328, 726 
Stewart, K. R., Bullock, J. S., Wechsler, R. H., Mailer, A. H., & Zentner, 

A. R. 2007, ApJ, in press arXiv:07 11.5027 [astro-ph], 711 
Thomas, D., Maraston, C, Bender, R., & Mendes de Oliveira, C. 2005, ApJ, 

621, 673 
Tormen, G., Moscardini, L., & Yoshida, N. 2004, MNRAS, 350, 1397 
Trager, S. C, Faber, S. M., Worthey, G., & Gonzalez, J. J. 2000, AJ, 119, 

1645 
Tremaine, S., et al. 2002, ApJ, 574, 740 
Treu, T, et al. 2005, ApJ, 633, 174 
Trujillo, I., et al. 2006, ApJ, 650, 18 
Vale, A., & Ostriker, J. P 2006, MNRAS, 371, 1 173 

van den Bosch, F C, Tomien, G., & Giocoli, C. 2005, MNRAS, 359, 1029 
van den Bosch, F. C, et al. 2007, MNRAS, 376, 841 



van der Wei, A., Franx, M., van Dokkum, P. G., Rix, H.-W., lllingworth, 

G. D., & Rosati, P 2005, ApJ, 631, 145 
van Dokkum, P., Franx, M., Kriek, M., Holden, B., lllingworth, G., Magee, 

D., Bouwens, R., Marchesini, D., Quadri, R., Rudnick, G., Taylor, E., & 

Toft, S. 2008, ArXiv e-prints, 802 
van Dokkum, P G. 2005, AJ, 130, 2647 
— . 2008, ApJ, 674, 29 

van Dokkum, P G., & van der Marel, R. P 2007, ApJ, 655, 30 
van Dokkum, P G., et al. 2004, ApJ, 61 1, 703 
— . 2006, ApJ, 638, L59 
Volonteri, M. 2007, ApJ, 663, L5 

Volonteri, M., Salvaterta, R., & Haardt, R 2006, MNRAS, 373, 121 
von der Linden, A., Best, P N., Kauffmann, G., & White, S. D. M. 2007, 

MNRAS, 379, 867 
Walter, F, Carilli, C, Bertoldi, F, Menten, K., Cox, P, Lo, K. Y, Fan, X., & 

Strauss, M. A. 2004, ApJ, 615, L17 
Wang, L., Li, C, Kauffmann, G., & de Lucia, G. 2006, MNRAS, 371, 537 
Wechsler, R. H., Bullock, J. S., Primack, J. R., Kravtsov, A. V., & Dekel, A. 

2002, ApJ, 568, 52 
Wechsler, R. H., Zentner, A. R., Bullock, J. S., Kravtsov, A. V., & Allgood, 

B. 2006, ApJ, 652, 71 
White, S. D. M. 1976, MNRAS, 174, 467 

Woo, J.-H., Treu, T., Malkan, M. A., & Blandford, R. D. 2006, ApJ, 645, 900 
Woods, D. R, Geller, M. J., & Barton, E. J. 2006, AJ, 132, 197 
Wuyts, S., et al. 2007, ApJ, 655, 51 
Wyithe, J. S. B., & Loeb, A. 2005, ApJ, 621, 95 
Yan, R., Madgwick, D. S., & White, M. 2003, ApJ, 598, 848 
Yang, X., Mo, H. J., & van den Bosch, R C. 2003, MNRAS, 339, 1057 
Younger, J. D., Hopkins, P. F., Cox, T. J., & Hemquist, L. 2008, ApJ, in press, 

arXiv:0804.2672 [astro-ph], 804 
Zavala, J., Okamoto, T., & Frenk, C. S. 2007, MNRAS, in press 

arXiv:0710.2901 [astro-ph], 710 
Zentner, A. R., Berlind, A. A., Bullock, J. S., Kravtsov, A. V., & Wechsler, 

R. H. 2005, ApJ, 624, 505 
Zheng, Z., Coil, A. L., & Zehavi, 1. 2007, ApJ, 667, 760 
Zheng, Z., et al. 2005, ApJ, 633, 791 
Zirm, A. W., et al. 2007, ApJ, 656, 66 



APPENDIX 
IMPLEMENTATION OF THE HALO OCCUPATION MODEL 

Here, we present a condensed outline of the simplest implementation of the model used to make the predictions in this paper As 
described in the text (§|2|i, we have experimented with a wide variety of modifications to the model elements and methodology 
- but our intention here is to summarize the basic framework upon which t hese modifications rep resent increasing layers of 
complexity. We therefore leave the description of these experiments to §l2land Hopkins et alj (l2008il) . 

As outlined in § 12. 11 our methodology consists of a few key steps: 

We construct a Monte Carlo sample of halos at some observed redshift Zobs, sa mpling according to the halo mass function at 
that redshift (constructed in standard fashion for the adopted cosmology following' Sheth et al.ll2001h . 

For each halo, we determine a mock growth history, i.e. Mhaio(z) for all z > Zobs up to some initial (maximum) redshift Zmit- 
There are several ways to do this: u sing the extended Press-Schechter f ormalism, adopting fits to individual halo growth his- 
tories in simulations (taken from e.g. iDe Lucia et al.ll2006t IStewart et al.l(2007,) or directly tracking the main-branch progenitor 
halo mass of each z = halo in the simulations, integrating over analytic fits to mean merger histories from initial seed popula- 
tions (Fakhouri & Ma 2008), or adopting the mean Mhaio(z|A^haio[z = Zobs]) for a given z = Zobs population of halos fitted in e.g. 
IWechsler et al. (2002, 2006); Neistein et al. (2006). Because we are considering the population in a statistical fashion, it makes 
no difference which of these approaches we adopt so long as they yield a similar median (and the scatter in galaxy properties in 
a given halo is, in any case, larger than the scatter at different times between these methodologies). The simplest approach, then, 
is to consider eac h halo in the Monte Carlo sample to have a mass at each z > Zobs given by the mean growth history (adopting 
the analytic fits in Neistein et al. 2006, in their Equation 1 1 and Appendices). 

Now we have an average Mhaio(z) for each halo in our representative Monte Carlo sample. We then begin at some initial redshift 
Zinitiai and cvolvc the system forward in timesteps of some Az.^ We ignore halos until they reach a mass (our effective resolution 
limit, corresponding to resolution limits in many of the simulations on which the HOD calculations are based) of Mhaio = 1O'"M0. 
This corresponds, given a typical HOD, to an extremely small stellar mass M, < 10^ Mq, so is irrelevant for the final mass of all 
the galaxies considered here, and is a source of little uncertainty. 

At each timestep, all galaxies that have not yet experienced a major merger above the resolution limit are initialized according to 
the halo occupation model (specifically, we initialize the galaxy stellar mass M»(Mhaio) and then determine R^iM^,) and /gas(A^*))- 
As described in the text, we have experimented with a variety of observational constraints regarding the implementation of 



* Here, we choose ziniiial = 6 and Az = 0.01, but in general we find that our results converge with respect to Az for values Az ^0.1, and t he predictions at any 
given Zobs converge rapidly once Zinitial is larger by some difference zinitiai —Zobs ^1—2. The reasons for this are discussed in the text (§ 12.3) and below. 



28 Hopkins et al. 

the HOD (se e JHopkinset al.ll2008il for a more detailed comparison). It is possible, for example, to use the quoted fits from 
IConroy et alJ (l2006l) at each of several redshifts, and linearly interpolate between ea ch redshift where the H OD was fitted to 
apply it to our model. It is also possible to use the methodology in that paper and in ' Vale & Ostrike j (12006') - monotonically 
ranking galaxies in stellar mass and halos in either mass or circular velocity, and assigning them to one another in one-to-one 
correlation - togeth er with an analytic redshift-dependent fit to the galaxy stellar mass function (extending to high redshift), 
such as that in Fontana et al.l (i2006) . to obtain the HOD at each redshift. In pra ctice, all of the s e applications yield similar 
results - these observational comparisons and other direct rneasur ements (see e.g. lYan et alj|2003l ICooravll2005l : IConrov et al] 
l2006t iHevmans et al.ll2006t IConrov et al.ll2007t iBrown et al.ll2008l) imply that the evolution in M*(Mhaio) with redshift is weak 
(and only significant at the highest masses, where number densities drop sufficiently rapidly at high redshift as to make them a 
negligible contribution to the lower-redshift population). For this reason, we obtain nearly identical results using the simplest 
possible prescription: assuming M*(Mhaio) is redshift-independent and applying the obs erved z = relation, assign ing each halo 
galaxy a stellar mass in Monte Carlo fashion (compare t he redshift-depend ent fits in IConroy & Wechsled 120081 who reach a 
similar conclusion). Specifically, we consider the fits from lWang et alJ (l2006l) . for central galaxies 



M^=M 



(Mh,io/M„)-" + (Mi,aio/M„y 



-/3 



(Al) 



with (Mo, Ml, a, (3) = (3.16 x IO^/j-'Mq, 4.48 x lO'^M©, 0.39, 1.96) and a lognormal scatter with a = 0.148 dex dispersion at 
eachMhaio ^■ 

We then assign each disk an effective radius Re(M^ \z) according to the methodology in § 12.31 where Re(M^ |z = 0) ~ 
3.4(Mdisk/10"'Mo)°''kpc from IShen et al.l (|2003|) (converted to our adopted cosmology and stellar IMF) and Re(M^\z) = 
(1 +zT^'' ReiM^, |z = 0) as per Equation Q (and we consider both /3,; = - no evolution - and /3^ = 0.4 - the evolution sug- 
gested by observations - showing the difference between the two in the range of uncertainties in our model). The gas fractions 
are assigned in similar fashion, with the z = observed relation in Equation dU and appropriate redshift evolution in Equation (|2|i 
(where we again consider a range to allow for observational uncertainties as described in the text, specifically /3 = 0.5-2.0 in 
Equation|2]i. 

These assignments are re-initialized at each redshift, if the halo remains un-merged. For this reason (and because at any 
redshift, most galaxies of a given mass have assembled in the last redshift interval Az ~ 1 - 2), the large uncertainties at high 
redshift are quickly suppressed at any lower redshift. For example, even though the uncertainty in R^ of a typical disk at high 
redshift (e.g. z = 4) is large (say, 0.5 dex), and this will enter as such in the uncertainty in predicted size of an elliptical forming at 
this redshift (although the uncertainty is somewhat suppressed since the system is likely to be gas-rich; see the discussion in §|2l), 
by z = 0, the original uncertainty will be suppressed by a typical factor ^ 16-32 (depending on how much the galaxy has grown 
by low-redshift mergers), introducing only ~ 0.02 -0.03 dex uncertainty in the low -redshift prediction (in other words, at lower 
redshifts, the uncertainties are dominated by the uncertainties in the HOD around those redshifts, rather than by the propagation 
of uncertainties from higher redshifts). 

The critical step at each redshift is then determining whether or not the galaxy experiences a merger. Because we are interested 
in galaxy-galaxy mergers, rather than e.g. a subhalo merging into the parent halo (and the two can be very different, as many 
subhalos, especially those which are small relative to the parent halo, may not merge for a Hubble time), we do not wish to 
adopt an extended Press-Schechter or simulation-based merger tree, but opt for a more sophi sticated approach. At each timestep, 
we consider the hal o to have a subhalo popula tion according to that fitted in simulations ( Kravtsov et alJl2004) (alternatively, 
following the fits in Ivan den Bosch et al J 120051 who adopt a semi-analytic methodology but reach similar conclusions) - this 
subhalo mass function is, in units of Msubhaio/A^haio, only weakly halo mass or redshift-dependent, and it makes no difference 
whether we adopt th e fits from dif ferent authors (we ha ve also com pared the subhalo mass fu nctions in Springel et al. 20011 
iTormen et a l. 2004; De Lucia et alJ i2004; Gao et al. 20041; IZentner et a l. 2005; Nurmi et al.ll2006L and reach similar conclusions). 
We then calculate the time for each such subhalo and its contained galaxy to merge with the central galaxy, according to one 
of several methodologies: either assuming a simple dynamical friction timescale, employing a more sophisticated timescale 
based on the cross section for resonant galaxy-galaxy interaction and orbital capture, or a timescale calculated based on a similar 
calculation in angular momentum space; the probability of a merger with each subsystem is then take n as the ratio of the tim estep 
to the merger timescale. We compare these methodologies in detail in iHopkins et al.l (l2008il) and iHopkins et alj (l2008c ) and 
discuss the details of the derivation for each. For our purposes here, as demonstrated in those papers, they give similar results, 
and we note that the results are also similar to what is commonly adopted in many semi-analytic models based on N-body 
simulations, which follow subhalos until they can no longer be resolved and then approximate the remaining merger timescale 
with a dynamical fr iction timescale estimate. 

In IHopkins et al.l (|2008f ) we show the detailed PDFs for the merger rate as a function of halo mass, galaxy mass, and red- 
shift (see specifically Figures 2-4 therein), that arise from various combinations of these assumptions. We show there that the 
different approac hes agree reasonab ly well, and agree with independent estimates such as sufficiently high resolution N-body 
experiments (e.g. 'Mailer et al.ll2006l ) and semi-analytic models with similar merger timescale calibrations from numerical tests 
(ISomerville et al. 2008a). For our purposes here, the important thing is that Hopkins et al. (2008f ) show that the model predictions 
agree well with the observed mass function of galaxy mergers at all redshifts observed; as well as the observed galaxy merger rate 
as a function of environmental density, halo mass (estimated from clustering or group dynamics), and redshift; and the clustering 

' The "turnover" in M, (Mi,j(,) in Equation jAU mainly corresponds to quenclied, spheroid galaxies, and so one could argue should not be applied to a sample 
of strictly un-merged galaxies. We have experimented with not including this turnover, i.e. adopting the low-mass slope of the HOD with M, oc A/jJ^'^, with a 
maximum at a stellar mass equal to the halo mass times the universal baryon fraction. Because very few systems survive to such large masses without merging, 
however, this makes no difference to our predictions. 
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(both large-scale and small-scale) and therefore halo occupation of mergers and recent merger remnants (see their Figures 6-12). 
This gives us some confidence that the calculation yiel ds a reasonabl e appro ximation to the merger rate. For the details and 
summary of the adopted calculation, we refer to § 2.1 of lHopkins et alj (1200811) . For our fiducial model i n this paper, we emplo y 
their "default" model: we construct the subhalo mass function for each halo adopting the analytic fits in iKravtsov et al.l ( |2004|) . 
We populate each subhalo as if it were a random member of our halo population (of the same mass) - specifically, we draw a 
random halo of the same mass from our Monte Carlo sample and assign the subhalo the same properties'^. We then estimate the 
merger timescale using the "gr oup capture" gravitational cross-section timescales, specifically using the fitting formulae from 
iKrivitsk v & Kontorovich' (4997) calibrated to a large set of numerical simulations of different encounters in group environments 
(see also White 1976; Makino & Hut 1997; Mamon 2006). This ratio of the timestep to this timescale gives the probability of 
each merger, and we determine whether each occurs in Monte Carlo fashion. 

When a merger occurs above the cutoff mass ratio threshold (a "major" merger), the properties of the remnant are determined 
as a function of the properties of both progenitors, as described in § 12.41 The final stellar mass is the sum of the two progenitor 
baryonic masses; the two constituent components, the dissipational and dissipationless stellar mass, of the remnant are the sum of 
the dissipational and dissipationless masses of the progenitors. For un-merged progenitors, the dissipational and dissipationless 
mass are the pre-merger gas and stellar mass, respectively. The effective radii of the remnants are calculated as described in the 
text, for the dissipationless component as a function of the mass ratio and scale radii of the progenitors (Equation|5]), and then with 
the appropriate correction applied for the dissipational mass fraction (Equation|6l). Other properties follow as describ ed in §12.41 
Properties such as the dark matter fraction within a given radius (for dynamical masses, etc) are calcu lated assuming a lHernquisa 
(11990) profile for the dark matter halo, with a redshift-dependent concentration from iBullock et all (|2001), and projecting the 
profile along with that of the galaxy. 

The stellar mass and stellar mass profile of major merger remnants are taken to be those calculated at the previous merger, 
although the halo grows continuously with time. In other words, major merger remnants are effectively "quenched" in some sense 
(although new gas can and does come into the galaxy through subsequent mergers with gas-rich galaxies) - we ignore details of 
the re-growth of disks around early-forming ellipticals through direct cosmological accretion of cold gas. Although this may be 
important for especially the most high-redshift systems, a proper treatment requires a much more complete cosmological model 
(ideally highly resolved cosmological simulations that can form proper low-redshift disks, still a major theoretical challenge). 
Moreover, it has been argued t hat feedback from quasar and starburst activity in major mergers rnay ac t ually be an iniportant 
physical ag ent of "quenching." (TScannap ieco & Ohl l2004tlGranato et"ani2004l;lMonaco & Fontanotil2005l : lHopkins et"aLll2006bt 
|2007a d; N aab et aLll2006bl: iBundv et al. 2008). in which case this is a more accurate assumption than allowing for new cooling. 

In any event, we show in Hopki ns et al.l (l2008c) that such a prescription, with a major merger mass ratio threshold set to the 
value used in this paper (1:3), yields very good agreement with the observed mass function and mass density of quenched galaxies 
as a function of redshift, as well as quenched central galaxy fraction as a bivariate function of stellar and halo mass, the clustering 
of red galaxies as a function of mass and redshift, the distribution of stellar population ages (and implicitly, "quenching times") 
as a function of elliptical stellar mass, and the distribution of elliptical structural properties related to cooling. This argues that 
the current prescription is at least a good approximation to the actual cooling histories of the galaxies of interest here. We have 
also experimented with alternative prescriptions such as those found in semi-analytic models ( with halo cooling suppressed by 
low-level AGN feedback, or truncated above a certa i n halo or stellar mass th reshold, see e.g. ICroton etliLl 120061: iBower et aH 
120061: ICattaneo et al.ll2006t Ide Lucia & Blaizoll2007l: ISomerviUe et al.ll2008al) . and find similar results (this is because, it turns 
out, the halo mass threshold for the transition to "hot mode" accretion and effective quenching in these models, and empirically 
constrained stellar/halo mass ranges for quenching, correspond quite closely with the regime where most galaxies experience 
their first major merger; see e.g. Cattaneo et al. 2006). 

This process is repeated until the desired Zobs is reached. We then discard the systems that remain un-merged (as these will not, 
predominantly, be elliptical galaxies) and construct the mock observed sample of interest from our Monte Carlo population. 



So long as the Monte Carlo sample is sufficiently lai'ge, this is identical to fi'acing the growth and merger history of the subhalo itself. As discussed in the 
text, we have explicitly chosen to ignore effects unique to satellites such as ram-pressure stripping. Although these are important for low-mass satellites which 
are long-lived in e.g. massive groups and clusters, they do not generally apply to the cases of interest: namely major mergers, for which the merger times are short 
(too short for long-timescale effects such as stripping and harassment) and the halos are of comparable mass (making e.g. ram-pressure relatively unimportant). 
Moreover, for the typical < a few L, galaxies of interest here, the "subhalo" phase is very short in major mergers - it represents a brief intermediate merging 
stage between field or small/loose group galaxies on their way to merging. For more massive cluster systems, however, the caveat should apply, along with the 
others discussed in §|5] 



